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NOVEL THERAPEUTIC MOLECULES AND USES THEREFOR 

The present invention relates generally to novel molecules capable of, inter alia, 
controlling cellular functional activity such as proliferation, differentiation and/or 
5 transcriptional regulation and to genetic sequences encoding same. More particularly, the 
present invention relates to novel members of the ETS family of proteins, referred to 
herein as "ELF5", and to genetic sequences encoding same. The molecules of the present 
invention are useful, for example, in therapy, diagnosis, antibody generation and as a 
screening tool for agents capable of modulating transcriptional events during cellular 
10 functioning such as in tumorigenesis. 

Background of the Inyention 

The ETS family of transcription factors share a conserved DNA binding domain, termed 
15 the *ETS domain*, first identified in the gag-myb-ets fusion protein of avian leukemia 
virus E26 (Nunn et a/., 1983; Watson et al, 1988; Karim et aL, 1990; Gutman and 
Wasylyk, 1991; Seth et aL, 1992). The ETS domain recognises and binds to purine rich 
GGA(A/T) core motifs in the promoters and enhancers of various target genes (Macleod et 
a/., 1992; Wasylyk et al, 1993; Janknecht and Nordheim, 1993; Wemer et al, 1995; 
20 Kodand^ani et aL, 1996). The ETS family does not maintain overall similarity outside 
of the ETS domain, but can be grouped into subfamilies based upon variation within the 
ETS domain, and also by the arrangement and presence of other domains, such as those 
involved in transactivation and sites of phosphorylation (Lautenberger et a/, 1992; 
Wasylyk et al, 1993; Janknecht and Nordheun, 1993). Over 30 ETS gene family 
25 members have been identified in species ranging from sea urchin to hiunan. 

Many ETS factors have been implicated in the control of cellular proliferation and 
tumorigenesis (Seth et aL, 1992; Macleod et a/., 1992; Wasylyk et aL, 1993; Janknecht 
and Nordheim, 1993; Scott et al, 1994a; Mutiiusamy et a/., 1995). ETSl, ETS2, ERG2 
30 and PU.l are proto-oncogenes with mitogenic and transforming activity when 
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overexpressed in fibroblasts (Seth et al., 1989; Seth and Papas, 1990; Hart et al., 1995; 
Moreau-Gachelin et al., 1996). In addition, chromosomal translocations involving ETS 
family members are associated with different human cancers. ERG and ERGB/FLIl are 
ftised to the EWS gene in t(21;22) and t(ll;22) translocations, respectively, in Ewing's 
5 sarcoma and other primitive neuroectodermal tumors (Sorensen et al. , 1994; Ida et al. , 
1995). FEV is ftised to EWS in a subset of Ewing's tumors in t(2;22) (Peter et al., 1997). 
TEL is fused to the platelet-derived growth factor receptor beta (PDGFRp) gene in t(5;12) 
translocations of chronic myelomonocytic leukemia, and to the acute myeloid leukemia 1 
(AMLl) transcription factor gene in t(12;21) translocations of acute lymphoblastic 
10 leukemia (Golub et al., 1994, 1995). Fusion of TEL to the receptor-associated kinase 
JAK2 results in early pre-B acute lymphoid leukemia in t(9;12), and in a typical chronic 
myelogenous leukemia in t(9; 15; 12) (Peelers et al. , 1997). Expression of Spil and Flil 
can be activated by position specific integration of the Friend murine leukemia virus in 
murine erythroleukemias (Ben-David et al., 1991). Also, ETSl, ETS2 and ERG regulate 
15 the expression of metalloproteinase genes, such as stromelysin and collagenase (Buttice 
and Kurkinen, 1993; Buttice et al., 1996; Wasylyk et al., 1991), which are important for 
extracellular matrix degradation concomitant with tumor vascularization (angiogenesis) 
and metastasis. 



ETS factors also have important developmental roles. Pointed P2 and yan play critical 
roles in Drosophila eye development (O'Neill et al., 1994). ETS2 is involved in 
skeletal/cartilage development (Sumarsono et al. , 1996). PU.l null mutation results in 
haematopoietic abnormalities (McKercher et al., 1996), and ETSl is involved in 
transactivation of genes required for T cell fiinction (Muthusamy et al., 1995; Sun et al., 
1995; Thomas et al., 1995; Thomas et al., 1997) and angiogenesis (Wasylyk et al., 1991; 
Vandenbunder et al., 1994; Wemert et al., 1992). 



ETS factors are ahnost all expressed m haematopoietic lineages (Bhat et al., 1989; 
etal., 1990; Kola et al., 1993), and indeed appear to ftinction predominantly in these 
and their related neoplasms. However, the most common solid tumors in humans are 
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carcinomas which arise from the transformation of epithelial cells. Transformed breast 
epithelial cells, for example, have been shown to express ETS family members GABPa, 
PEA3, ELFl, ETSl and ELKl (Scott et aL, 1994b; Delannoy-Courdent et al, 1996), but 
expression of these ETS family members is not restricted to epithelial cells. One ETS 
5 family member, ELF3/ESX/ESE-1/ERT, has recently emerged with epithelial and 

epithelial-cancer specific expression (Tymms et aL, 1997; Chang et a/., 1997; Choi et aL, 
1998; Oettgen et al, 1997). 

In work leading up to the present invention, the inventors have identified and sequenced a 
10 novel member of the ETS family, designated herein "ELF5". 

Summary of the Invention 

Bibliographic details of the publications referred to by author in this specification are 
15 collected at the end of the description. Sequence Identity Numbers (SEQ ID NOs.) for the 
nucleotide and amino acid sequences referred to in the specification are defined following 
the bibliography. A summary of the SEQ ID NOs. is provided before the Examples. 

Throughout this specification and the claims which follow, unless the context requires 
20 otherwise, the word "comprise", and variations such as "comprises" and "comprising", will be 
understood to imply the inclusion of a stated integer or step or group of integers or steps but 
not the exclusion of any other integer or step or group of integers or steps. 

One aspect of the present invention provides an isolated nucleic acid molecule comprising a 
25 nucleotide sequence encoding ELF5 wherein said ELF5 comprises an ETS domain. 

Another aspect of the present invention provides a nucleic acid molecule comprising a 
nucleotide sequence encoding or complementary to a sequence encoding an amino acid 
sequence substantially as set forth in one of SEQ ID NO:2 or 4 or a derivative thereof or 
30 having at least about 45% or greater similarity to one or more of SEQ ID NO: 2 or 4 or a 
derivative thereof. 
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Yet another aspect of the present invention provides a nucleic acid molecule comprising a 
nucleotide sequence encoding or complementary to a sequence encoding an amino acid 
sequence substantially as set forth in SEQ ID NO: 7 or having at least about 45% or greater 
similarity to a sequence comprising the amino acid sequence set forth in SEQ ID NO: 7 or a 
5 derivative thereof 

Still yet another aspect of the present invention contemplates a nucleic acid molecule 
comprising a nucleotide sequence substantially as set forth in one of SEQ ID NO: 1 or 3 or a 
derivative thereof capable of hybridising to one of SEQ ID NO: 1 or 3 under low stringency 
10 conditions at 42°C and v^hich encodes an amino acid sequence corresponding to an amino 
acid sequence set forth in one of SEQ ID NO:2 or 4 or a sequence having at least about 45% 
similarity to one or more of SEQ ED NO: 2 or 4. 

A further aspect of the present invention contemplates a nucleic acid molecule comprising a 
15 nucleotide sequence substantially as set forth in one of SEQ ID NO: 5 or 6 or a derivative 
thereof capable of hybridising to one of SEQ ID NO: 5 or 6 under low stringency conditions 
at 42°C and which encodes an amino acid sequence corresponding to an amino acid sequence 
set forth in SEQ ID NO:7 or a sequence having at least about 45% similarity to SEQ ID 
N0:7, 

20 

Yet another further aspect of the present invention provides a nucleotide sequence 
corresponding to ELF5 is a cDNA sequence comprising a sequence of nucleotides as set 
forth in one of SEQ ID NO: 1 or 3 or is a derivative thereof including a nucleotide sequence 
having similarity to one of SEQ ID NO: 1 or 3 and which encodes an amino acid sequence 
25 corresponding to an amino acid sequence set forth in one of SEQ ID NO:2 or 4 or a 
sequence having at least about 45% similarity to one or more of SEQ ID NO:2 or 4. 

Still yet another further aspect of the present invention provides a nucleotide sequence 
corresponding to ELF5 is a cDNA sequence comprising a sequence of nucleotides as set 
30 forth in one of SEQ ID NO: 5 or 6 or is a derivative thereof including a nucleotide sequence 
having similarity to one of SEQ ID NO: 5 or 6 and which encodes an amino acid sequence 
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corresponding to an amino acid sequence as set forth in SEQ ID NO: 7 or a sequence having 
at least about a 45% similarity to SEQ ID N0:7. 

Another aspect of the present invention contemplates a method of modulating activity of 
5 ELF5 in a mammal, said method comprising administering to said mammal a modulating 
effective amount of an agent for a time and under conditions sufficient to increase or decrease 
ELF5 activity. 

Yet another aspect of the present invention contemplates a method of modulating cellular 
10 functional activity in a mammal said method comprising administering to said manmial an 
effective amount of an agent for a time and under conditions sufficient to modulate the 
expression of a nucleotide sequence encoding ELF5 or sufficient to modulate the activity of 
ELF5. 

15 Still yet another aspect of the present invention contemplates a method of modulating cellular 
functional activity in a mammal said method comprising administering to said manmial an 
effective amount of ELF5 or ELF5, 

A further aspect of the present invention relates to a method of treating a mammal said 
20 method comprising administering to said mammal an effective amount of an agent for a time 
and under conditions sufficient to modulate the expression of ELF5 or sufficient to modulate 
the activity of ELF5 wherein said modulation results in modulation of cellular fimctional 
activity. 

25 Yet another further aspect of the present invention relates to a method of treating a mammal 
said method comprising administering to said mammal an effective amount of ELF5 or ELF5 
for a time and under conditions sufficient to modulate cellular functional activity. 

Still yet another further aspect of the present invention relates to the use of an agent capable 
30 of modulating the expression of ELF5 or modulating the activity of ELF5 in the manufacture 
of a medicament for the modulation of cellular functional activity. 
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Another aspect of the present invention relates to the use of ELF5 or ELF5 in the 
manufacture of a medicament for the modulation of cellular functional activity. 

5 Yet another aspect of the present invention relates to agents for use in modulating ELF5 
expression or ELF5 activity wherein said modulation results in modulation of cellular 
functional activity. 

Still yet another aspect of the present invention relates to ELF5 or ELF5 for use in 
10 modulating cellular functional activity. 

A further aspect of the present invention contemplates a pharmaceutical composition 
comprising ELF5, ELF5 or an agent capable of modulating ELF5 expression or ELF5 
activity together with one or more pharmaceutically acceptable carriers and/or diluents. 
15 ELF5, ELF5 or said agent are referred to as the active ingredients. 

Another further aspect of the present invention contemplates a method for detecting ELF5 or 
ELF5 naRNA in a biological sample from a subject said method comprising contacting said 
biological sample with an antibody specific for ELF5 or ELF5 mRNA or its derivatives or 
20 homologs for a time and under conditions sufficient for an antibody-ELF5 or antibody-ELF5 
mRNA complex to form, and then detecting said complex. 
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Single and three letter abbreviations used throughout the specification are defined in 
Table 1. 

TABLE 1 

5 Single and three letter amino acid abbreviations 



Amino Acid Three-letter One-letter 

Abbreviation Symbol 



Alanine 


Ala 


A 


Arginine 


Arg 


R 


Asparagine 


Asn 


N 


Aspartic acid 


Asp 


D 


Cysteine 


Cys 


C 


Glutamine 


Gin 


Q 


Glutamic acid 


Glu 


E 


Glycine 


Gly 


G 


Histidine 


His 


H 


Isoleucine 


He 


I 


Leucine 


Leu 


L 


Lysine 


Lys 


K 


Methionine 


Met 


M 


Phenylalanine 


Phe 


F 


Proline 


Pro 


P 


Serine 


Ser 


S 


Threonine 


Thr 


T 


Tryptophan 


Trp 


W 


Tyrosine 


Tyr 


Y 


Valine 


Val 


V 


Any residue 


Xaa 


X 
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Brief Description of the Drawings 

Figure 1 is a schematic representation of murine ELF5 cDNA sequence and relationship to 
mRNA transcripts, (a) The nucleotide sequence of murine ELF5 is shown. Breaks in the 

5 sequence indicate the source of sequence data; the central region (92-1528) was sequenced 
from lambda clones, and 5 prime and 3 prime were added from sequencing of RACE PGR 
products. Numbering of the nucleotides, starting with the most 5-prime sequences obtained, 
are indicated on the right. The open reading frame (ORF) is shown in capital letters, with the 
initiating start and stop codons underlined. A stop codon, in the same reading frame as the 

10 ORF, but 5 prime to the initiating codon, is also underlined. The ETS domain is indicated in 
a shaded box. Putative polyadenylation signals are underlined with dashed lines. A/T rich 
tracts in the 3 prime untranslated region are boxed, (b) Northern blot analysis of day 14 
mouse placenta: lane 1, probed with random-prime-labeled 940 bp Styl murine ELF5 cDNA 
fragment (probe 1); lane 2, probed with random-prime-labeled murine £LF5 696 bp 3'- 

15 RACE PCR product (probe 2). Positions of 28S and 18S markers are indicated. Both lanes 
were also probed with GAPDH cDNA (lower panels). 

Figure 2 is a schematic representation of (a) Comparison of human and mouse ORFs. 
Amino acid sequences present in both human and mouse ELF5 are shaded. The ETS domain 

20 is boxed with a solid line and the pointed domain with a dashed line. Putative 

phosphorylation sites, conserved between the two species are circled and labeled as CKII 
(casein kinase 11), PKC (protein kinase C) or TyP (tyrosine kinase) substrates, (b) 
Comparison of the ETS domain of human and mouse ELF5 with those of known members of 
the ETS gene family. The alignment was generated using CLUSTAL W (Thompson et aL, 

25 1994) with the default settings, and the result was subsequently adjusted manually. The ETS 
factors examined are labeled on the left and include hELF3, mELF3, hNERF, dETS4, 
dE74A, hELFl, hELKl, hTEL, hERM, raERSl, mPEA3, mGABP« mERP, dETS6, mPUl, 
hPEl, hSAPl, hSPIB, dYAN, hERG, mFLIl, dELG, dETS3, noETSl, mETS2, mER71, 
where 'h' denotes human, 'm' mouse and *d* Drosophila. The ETS consensus sequence is a 

30 list of the amino acids most often conserved between ETS family members. Shading denotes 
amino acid identity with human ELF5, and the percent identity of each ETS domain is 
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indicated on the right, (c) Phylogenetic tree of the ETS domain produced by maximum 
likelihood analysis. The alignment in Figure 2b was analysed using the JTT-F substitution 
model (Jones et aL, 1992) and local bootstrap values were estimated for all internal branches, 
both by using PROTML in Q mode followed by a second run in R mode (Adachi and 
5 Hasegawa, 1996). An underlying assumption of the phylogenetic analysis is that the amino 
acid content does not vary signijBcantly among the sequences. This assumption was not 
assessed because tools for doing so are still under development (LSJ, unpublished work). 
Therefore, the tree may be the result of both historical and compositional components. The 
four points at which gene duplications have been inferred are marked A, B, C and D. (d) 
10 Comparison of the pointed domain of human and mouse ELF5 with those of other members 
of the ETS family. The ETS factors examined are labeled on the left and include hERG, 
hELF3, hTEL, hGABP«, hETSl, hETS2, dYAN and dPOINTEDP2. Other labels and 
conventions are as described for Figure 2b. 

15 Figure 3 is a schematic representation of the chromosomal localization of human ELF5, 
Human chromosomal localization of ELF5 was performed by PGR using gene specific 
primers and the Genebridge 4 Radiation Hybrid DNA panel (UK HGMP Resource Centre). 
Diagram based upon PCR results (data not shown) showing localization of ELF5 v^thin 
chromosome one, with respect to adjacent marker obtained fi-om mapping data (see text). 

20 

Figure 4 is a photographic representation of ELF5 expression in mouse tissues. Positions of 
28 S and 18S markers are indicated. ELFSa and ELF5b transcripts are indicated, (a) 
Northern analysis of adult mouse tissues probed v^th murine ELF5 cDNA (top panels) and 
GAPDH cDNA (lower panels). Abbreviations; Li: liver; Lu: lung; Br: brain; Ki: kidney; He: 

25 heart; Sm: small intestine; Sp: spleen; Th: thymus; St: stomach; Ov: ovary; Pa: pancreas; To: 
tongue; Sk: skeletal muscle; Bl: bladder; 2Fa: day 2 pregnant fat; 2 Ma: day 2 pregnant 
mammary gland; 10 Fa: day 10 pregnant fat; 10 Ma: day 10 pregnant mammary gland; Co: 
colon. Arrow indicates position of brain specific transcript (see text), (b) Northern analysis 
as above, but using RNA fi"om day 1 neonate mouse tissues. Additional abbreviation; In: 

30 intestine. Arrow indicates position of large transcript (see text), (c) Northern analysis as 
above, but using RNA fi-om day 16, 17 and 19 embryonic tissues, (d) Northern analysis as 
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above, but using RNAfrom day 9.5 to day 19 placental tissues as indicated. 

Figure 5 is a photographic representation of ELF5 expression in human tissues and cell lines, 
(a) Northern analysis of adult human tissues probed with human ELF5 cDNA (top panels) 
5 and p-Actin cDNA (lower panels). The single ELF5 transcript is indicated. Other labels and 
conventions are as for Figure 4. Abbreviations; He: heart; Br: brain; PI: placenta; Lu: lung; 
Li: liver; Sk: skeletal muscle; Ki: kidney; Pa: pancreas; Sp: spleen; Th: thymus; Pr: prostate; 
Te: testis; Ov: ovary; Sm: small intestine; Co: colon mucosa; PBL: peripheral blood 
lymphocytes, (b) RNAse protection analysis of ELF5 and GAPDH in cell lines; 1 : CaOv-3 

10 (ovarian carcinoma); 2: BT-549 (ductal breast carcinoma); 3: ZR-75-1 (breast carcinoma); 4: 
T47D (ductal breast carcinoma, progesterone sensitive); 5: 786-0 (renal adenocarcinoma); 6: 
SK-HEP-1 (liver adenocarcinoma); 7: A549 (lung adenocarcinoma); 8: CCL32SK (primary 
fibroblast); 9: MEL28 (melanoma); 10: WISH (amnion carcinoma); 11: Jurkat (T cell 
leukemia); 12: DU145 (prostate carcinoma); 13: PC3 (prostate carcinoma); 14: HEC-1 

15 (endometrium carcinoma); 15: K562 (erythroid leukemia), (c) Southern analysis of ELF5 in 
Bgin digested genomic DNA firom cell lines; 1: normal blood; 2: BT-549 (ductal breast 
carcinoma); 3: ZR-75-1 (breast carcinoma); 4: T47D (ductal breast carcinoma); 5: NCI- 
HI 299 (large cell lung carcinoma); 6: NCI-HI87 (small cell lung carcinoma); 7: NCI-H322 
(bronchioalveolar carcinoma); 8: NCI-H358 (bronchioalveolar carcinoma); 9: NCI-H522 

20 (lung adenocarcinoma); 10: SK-LU-1 (lung adenocarcinonfia); 11: NCI-H441 

(bronchioalveolar carcinoma); 12: NCI-H460 (large cell lung carcinoma); 13: NCI-H661 
(large cell lung carcinoma). 

Figure 6 is a photographic representation of ELF5 binding to consensus ETS binding 
25 sequences, (a) His-tagged ELF5 recombinant protein, present in E. coli lysates (lane 2), was 
purified by metal-affinity chromatography to approximately 90% (lane 3) and eluted with 
imadazole (lane 4). (b) Specific DNA binding of Elf5 was analysed by electrophoretic 
mobility shift assay (EMSA), using labeled double-stranded oligonucleotides as probes. E74 
conteiins a consensus binding site for ETS family members (lane 1). E74ml is a mutant 
30 oligonucleotide based on E74, but with the core GGAA replaced by AGAA (lane 2). Binding 
to other consensus ETS sites was analysed by the ability of a 100-fold excess of unlabeled 
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double-Stranded oligonucleotide to compete with E74 for Elf5 binding GMETS contains an 
ETS binding site from the human GM-CSF promoter (lane 6). ERBB2 contains an ETS 
binding site from the human erbB2/HER2 promoter (lane 7). MS V contains an ETS binding 
site present in the long terminal repeat of the Moloney sarcoma virus (lane 8). API contains 
5 a consensus API binding site used as a negative control ELF5-DNA complexes are marked. 
Binding of ETS 1 to E74 was used as a positive control (lane 10). 

Figure 7 is a graphical representation of transactivation by ELF5. COS cells were co- 
transfected with CAT reporter and EI£5 expression constructs. Transcription of the CAT 

10 gene was driven by the thymidine kinase (tk) minimal promoter with five copies of the 
polyomavirus enhancer inserted upstream (pSXpoly). The polyomavirus enhancer contains 
adjacent ETS and API binding sites. The ELF5 sense construct (pBOSElfSas) was designed 
to express ELF5 protein, and the ELF5 anti-s construct (pBOSElfSas) to produce anti-sense 
transcripts. In the absence of expression construct the equivalent amount of base vector 

15 (pEFBOS) was co-transfected. COS cells were processed for CAT assays and the results of 
at least four replicates are shown as the mean with standard error of the mean (s.e.m.) bars. 
Statistically significant results are indicated by asterisks. A single asterisk indicates moderate 
significance (0.05>P>0.01) and triple asterisks indicate very high significance (P<0.001). 

20 Figure 8 is a photographic representation of breast tissue sections from parafin-embedded 
samples which had been hybridized with ELF5 antisense RNA. 



P:\OPER\TDO\ELF5.PRV . 27/8/98 



- 12- 

Detailed Description of the Invention 

The present invention is predicated, in part, on the identification of a novel member of the 
ETS family of molecules, termed ELF5. The identification of this novel molecule permits the 
5 identification and rational design of a range of products for use in therapy, diagnosis and 
antibody generation involving, for example, regulation of cellular functional activity such as 
cellular proliferation. These therapeutic molecules may also act as either antagonists or 
agonists of ELF5 fimction and will be usefiil, inter alia, in cancer and autoimmune disease 
therapy. 

10 

Accordingly, one aspect of the present invention provides an isolated nucleic acid molecule 
comprising a nucleotide sequence encoding ELF5 wherein said ELF5 comprises an ETS 
domain. 

1 5 Reference to an "ETS domain" should be understood as a reference to a protein domain 
which recognises and binds to a purine rich GGA(A/T) motif of a promoter or enhancer 
(Macleod et aL, 1992; Wasylyk et al., 1993; Janknecht and Nordheim, 1993; Werner ei aL, 
1995; Kodandapani et al., 1996). The ETS domain may be continuous, meaning that it is 
comprised of a continuous sequence of antiino acids, or it may be discontinuous, meaning that 

20 it is comprised of individual amino acids or sequences of amino acids fi*om two or more 
separate regions of the protein and which are brought into proximity with one another to 
form the ETS domain due to the secondary, tertiary or quaternary structure of the protein. 

More p^articularly, the present invention provides a nucleic acid molecule comprising a 
25 nucleotide sequence encoding or complementary to a sequence encoding an amino acid 
sequence substantially as set forth in one of SEQ ID NO:2 or 4 or a derivative thereof or 
having at least about 45% or greater similarity to one or more of SEQ ID NO:2 or 4 or a 
derivative thereof. 

30 Another aspect of the present invention provides a nucleic acid molecule comprising a 
nucleotide sequence encoding or complementary to a sequence encoding an amino acid 
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sequence substantially as set forth in SEQ ED NO:7 or having at least about 45% or greater 
similarity to a sequence comprising the amino acid sequence set forth in SEQ ID NO: 7 or a 
derivative thereof 

5 The term "similarity" as used herein includes exact identity between compared sequences 
at the nucleotide or amino acid levels. Where there is non-identity of the nucleotide level 
"similarity" includes differences between sequences which result in different amino acids 
that are nevertheless related to each other at the structural, functional, biochemical and/or 
conformational levels. Where there is non-identity at the amino acid level, "similarity" 
10 includes amino acids that are nevertheless related to each other at the structural, 

functional, biochemical and/or conformational levels. The percentage similarity may be 
greater than 50% such as at least 70% or at least 80% or at least 90% or higher. 

Another aspect of the present invention contemplates a nucleic acid molecule comprising a 
1 5 nucleotide sequence substantially as set forth in one of SEQ ED NO: 1 or 3 or a derivative 
thereof capable of hybridising to one of SEQ ID NO: 1 or 3 under low stringency conditions 
at 42°C and which encodes an amino acid sequence corresponding to an amino acid sequence 
set forth in one of SEQ ED NO:2 or 4 or a sequence having at least about 45% similarity to 
one or more of SEQ ID NO:2 or 4. 

20 

More particularly, the present invention contemplates a nucleic acid molecule comprising a 
sequence of nucleotides substantially as set forth in SEQ ID NO: 1 or 3. 

Another aspect of the present invention contemplates a nucleic acid molecule comprising a 
25 nucleotide sequence substantially as set forth in one of SEQ ID NO: 5 or 6 or a derivative 
thereof capable of hybridising to one of SEQ ID NO: 5 or 6 under low stringency conditions 
at 42°C and which encodes an amino acid sequence corresponding to an amino acid sequence 
set forth in SEQ ID NO: 7 or a sequence having at least about 45% similarity to SEQ ID 
NO:7. 

30 

More particularly, the present invention contemplates a nucleic acid molecule comprising a 
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sequence of nucleotides substantially as set forth in SEQ ID NO: 5 or 6. 

Reference herein to a low stringency at 42''C includes and encompasses from at least about 
1% v/v to at least about 15% v/v formamide and from at least about IM to at least about 2M 
salt for hybridisation, and at least about IM to at least about 2M salt for washing conditions. 
Alternative stringency conditions may be applied where necessary, such as medium 
stringency, which includes and encompasses from at least about 16% v/v to at least about 
30% v/v formamide and from at least about 0.5M to at least about 0.9M salt for 
hybridisation, and at least about 0.5M to at least about 0.9M salt for washing conditions, or 
high stringency, which includes and encompasses from at least about 3 1% v/v to at least 
about 50% v/v formamide and from at least about 0.0 IM to at least about 0.1 5M salt for 
hybridisation, and at least about 0,0 IM to at least about 0.1 5M salt for washing conditions. 
In general, washing is carried out at T„ = 69.3 + 0,41 (G + C) % [19] = -12X. However, the 
T„ of a duplex DNA decreases by PC with every increase of 1% in the number of 
mismatched based pairs (20). 

The nucleic acid molecule according to this aspect of the present invention corresponds 
herein to "ELF5'\ This gene has been determined in accordance with the present invention to 
encode a protein which displays specific binding to DNA sequences comprising a GGA(A/T) 
20 core. The product of the ELF5 gene is referred to herein as ELF5. ELF5 is defined as 

belonging to the ETS family of transcription factors due to its expression of an ETS domain 
which recognises and binds the purine rich GGA(A/T) core motifs. ELF5 is a protein for 
which splice variants exist, thereby resulting in the expression of a variety of isoforms. 
Human ELF5 and human ELF5 short transcript are examples of 2 isoforms which differ in 
25 size due to the splicing out of exon regions from the ELF5 mRNA molecule encoding the 
ELF5 short transcript. Murine ELF5a and ELFSb are examples of 2 mRNA transcripts 
which differ in the length of the 3' untranslated region. Human ELF5 and ELF5 short 
transcript are defined by the amino acid sequences set forth in SEQ ED NO: 2 and 4, 
respectively and murine ELF5 is defined by the amino acid sequence set forth in SEQ ID 
30 NO:7. The cDNA nucleotide sequences for human ELF5 and ELF5 short transcript are 
defined by the nucleotide sequences set forth in SEQ ID NO: 1 and 3, respectively, and 
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murine ELF5a and ELFb are defined by the nucleotide sequences set forth in SEQ ED NO: 5 
and 6, respectively. 

The nucleic acid molecules encoding ELF5 are preferably a sequence of deoxyribonucleic 
5 acids such as cDNA sequences or genomic sequences. A genomic sequence may also 
comprise exons and introns. A genomic sequence may also include a promoter region or 
other regulatory region. 

Reference hereinafter to "ELF5" and "£LF5" should be understood as a reference to all forms 
10 of ELF5 and ELF5, respectively, including by way of example the two mRNA transcripts, 
ELFSa and ELF5b, observed in the mouse. Without limiting the invention in any way, 
sequence analysis of murine ELF5 has revealed two discrete polyadenylation signals present 
in the 3' untranslated region (UTR). The first of these (at 1391 bp) appears to be an 
overlapping poIy(A)'*' recognition signal AATTAA and ATTAAAA. The second is a 
15 consensus polyadenylation signal, AATAAA, at 2181 bp. Sequence analysis of human £LF5 
has also revealed two mRNA transcripts arising fi^om the splicing out of part of the exon 
region. Accordingly, the present invention should be understood to extend to all cDNA and 
peptide isoforms arising fi*om alternative splicing of ELF5 mRNA. 

20 The protein and/or gene is preferably fi^om a human, primate, livestock animal (eg. sheep, pig, 
cow, horse, donkey) laboratory test animal (eg. mouse, rat, rabbit, guinea pig) companion 
animal (eg. dog, cat), captive wild animal (eg. fox, kangaroo, deer), aves (eg. chicken, geese, 
duck, emu, ostrich), reptile or fish. 

25 The term "protein" should be understood to encompass peptides, polypeptides and proteins. 
The protein may be glycosylated or unglycosylated, phosphorylated or dephosphorylated to 
various degrees and/or may contain a range of other molecules fiised, linked, bound or 
otherwise associated to the protein such as amino acids, lipids, carbohydrates or other 
peptides, polypeptides or proteins. Reference hereinafer to a "protein" includes a protein 

30 comprising a sequence of amino acids as well as a protein associated with other molecules 
such as amino acids, lipids, carbohydrates or other peptides, poljrpeptides or proteins. 
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Derivatives include fragments, parts, portions, chemical equivalents, mutants, homologs, 
mimetics from natural, synthetic or recombinant sources including fusion proteins. 
Derivatives may be derived from insertion, deletion or substitution of amino acids. 
Amino acid insertional derivatives include amino and/or carboxylic terminal fusions as 
5 well as intrasequence insertions of single or multiple amino acids. Insertional amino acid 
sequence variants are those in which one or more amino acid residues are introduced into 
a predetermined site in the protein although random insertion is also possible with suitable 
screening of the resulting product. Deletional variants are characterized by the removal of 
one or more amino acids from the sequence. Substitutional amino acid variants are those 
10 in which at least one residue in the sequence has been removed and a different residue 
inserted in its place. Additions to amino acid sequences including fusions with other 
peptides, polypeptides or proteins. 

The derivatives of ELF5 include fragments having particular epitopes or parts of the entire 
15 ELF5 protein fused to peptides, polypeptides or other proteinaceous or non-proteinaceous 
molecules. For example, ELF5 or derivative thereof may be fused to a molecule to 
facilitate its entry into a cell. Analogs of ELF5 contemplated herein include, but are not 
limited to, modification to side chains, incorporating of unnatural amino acids and/or their 
derivatives during peptide, polypeptide or protein synthesis and the use of crosslinkers and 
20 other methods which impose conformational constraints on the proteinaceous molecules or 
their analogs. Derivatives of nucleic acid sequences may similarly be derived from single 
or multiple nucleotide substitutions, deletions and/or additions including fusion with other 
nucleic acid molecules. The derivatives of the nucleic acid molecules of the present 
invention include oligonucleotides, PGR primers, antisense molecules, molecules suitable 
25 for use in co suppression and fusion of nucleic acid molecules. 

Examples of side chain modifications contemplated by the present invention include 
modifications of amino groups such as by reductive alkylation by reaction with an 
aldehyde followed by reduction with NaBH4; amidination with methylacetimidate; 
30 acylation with acetic anhydride; carbamoylation of amino groups with cyanate; 
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trinitrobenzylation of amino groups with 2, 4, 6-trinitrobenzene sulphonic acid (TNBS); 
acylation of amino groups with succinic anhydride and tetrahydrophthalic anhydride; and 
pyridoxylation of lysine with pyridoxal-5-phosphate followed by reduction with NaBH4. 

5 The guanidine group of arginine residues may be modified by the formation of 

heterocyclic condensation products with reagents such as 2,3-butanedione, phenylglyoxal 
and glyoxal. 

The carboxyl group may be modified by carbodiimide activation via O-acylisourea 
10 formation followed by subsequent derivitisation, for example, to a corresponding amide. 

Sulphydryl groups may be modified by methods such as carboxymethylation with 
iodoacetic acid or iodoacetamide; performic acid oxidation to cysteic acid; formation of a 
mixed disulphides with other thiol compounds; reaction with maleimide, maleic anhydride 
15 or other substituted maleimide; formation of mercurial derivatives using 4- 

chloromercuribenzoate, 4-chloromercuriphenylsulphonic acid, phenylmercury chloride, 2- 
chloromercuri-4-nitrophenol and other mercurials; carbamoylation with cyanate at alkaline 
pH. 

20 Tryptophan residues may be modified by, for example, oxidation with N- 

bromosuccinimide or alkylation of the indole ring with 2-hydroxy-5-nitrobenzyl bromide 
or sulphenyl halides. Tyrosine residues on the other hand, may be altered by nitration 
with tetranitromethane to form a 3-nitrotyrosine derivative. 

25 Modification of the imidazole ring of a histidine residue may be accomplished by 
alkylation with iodoacetic acid derivatives or N-carboethoxylation with 
diethy Ipy rocarbonate . 

Examples of incorporating unnatural amino acids and derivatives during protein synthesis 
30 include, but are not limited to, use of norleucine, 4-amino butyric acid, 4-amino-3- 
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hydroxy-5-phenylpentanoic acid, 6-aminohexanoic acid, t-butylglycine, norvaline, 
phenylglycine, omidiine, sarcosine, 4-amino-3-hydroxy-6-methylheptanoic acid, 2-diienyl 
alanine and/or D-isomers of amino acids. A list of unnatural amino acid contemplated 
herein is shown in Table 2. 
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TABLE2 



Non-conventional 
amino acid 

5 


Code 


Non-conventional 
amino acid 


Code 


a-aminobutyric acid 


Abu 


L-N-methylalanine . 


Nmala 


cc-amino-a-methylbutyrate 


Mgabu 


L-N-methylarginine 


Nmarg 


aminocyclopropane- 


Cpro 


L-N-methylasparagine 


Nmasn 


carboxylate 




L-N-methylaspartic acid 


Nmasp 


10 aminoisobutyric acid 


Aib 


L-N-methylcysteine 


Nmcys 


aminonorbomyl- 


Norb 


L-N-methylglutamine 


Nmgln 


carboxylate 




L-N-methylglutamic acid 


Nmglu 


cyclohexylalanine 




Chexa L-N-methylhistidine 


Nmhis 


cyclopentylalanine 


Cpen 


L-N-methylisoUeucine 


Nmile 


15 D-alanine 


Dai 


L-N-methylleucine 


Nmleu 


D-arginine 


Darg 


L-N-methyllysine 


Nmlys 


D-aspartic acid 


Dasp 


L-N-methylmethionine 


Nmmet 


D-cysteine 


Dcys 


L-N-methybiorleucine 


Nmnle 


D-glutamine 


Dgln 


L-N-methylnorvaline 


Nmnva 


20 D-glutamic acid 


Dglu 


L-N-methylomithine 


Nmom 


D-histidine 


Dhis 


L-N-methylphenylalanine 


Nmphe 


D-isoleucine 


Dile 


L-N-methylproline 


Nmpro 


D-leucine 


Dleu 


L-N-methylserine 


Nmser 


D-lysine 


Diys 


L-N-methylthreonine 


Nmthr 


25 D-methionine 


Dmet 


L-N-methyltryptophan 


Nmtrp 


D-omithine 


Dom 


L-N-methyltyrosine 


Nmtyr 


D-phenylalanine 


Dphe 


L-N-methylvaline 


Nmval 


D-proline 


Dpro 


L-N-methylethylglycine 


Nmetg 


D-serine 


Dser 


L-N-methyl-t-butylglycine 


Nmtbug 


30 D-threonine 


Dthr 


L-norleucine 


Nle 
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D-tryptophan 


Dtrp 


L-norv aline 


Nva 


D-tyrosine 


Dtyr 


a-methyl-aminoisobutyrate 


Maib 


D-valine 


Dval 


a-methyl-y-aminobutyrate 


Mgabu 


D-a-methylalanine 


Dmala 


a-methylcyclohexylalanine 


Mchexa 


5 D-a-methylarginine 


Dmarg 


a-methylcylcopentylalanine 


Mcpen 


D-cc-methylasparagine 


Dmasn 


a-methyl-a-napthylalanine 


Manap 


D-a-methylaspartate 


Dmasp 


a-methylpenicillamine 


Mpen 


D-a-methylcysteine 


Dmcys 


N-(4-aminobutyl)glycine 


Nglu 


D-a-methylglutamine 


Dmgln 


N-(2-aminoethyl)glycine 


Naeg 


10 D-a-methylhistidine 


Dmhis 


N-(3-aminopropyl)glycine 


Nom 


D-a-methylisoleucine 


Dmile 


N-amino-a-methylbutyrate 


Nmaabu 


D-a-methylleucine 


Dmleu 


a-nq)thylalanine 


An^ 


D-a-methyllysine 


Dmlys 


N-benzylglycine 


Nphe 


D-a-methylmethionine 


Dmmet 


N-(2-carbamylethyl)glycine 


Ngln 


15 D-a-methylomithine 


Dmom 


N-(carbamyhnethyl)glycine 


Nasn 


D-a-mediylphenylalanine 


Dmphe 


N-(2-carboxyethyl)glycine 


Nglu 


D-a-methylproline 


Dmpro 


N-(carboxymethyl)glycine 


Nasp 


D-a-methylserine 


Dmser 


N-cyclobutylglycine 


Ncbut 


D-a-methylthreonine 


Dmthr 


N-cycloheptylglycine 


Nchep 


20 D-a-methyltryptophan 


Dmtrp 


N-cyclohexylglycine 


Nchex 


D-a-methyltyrosine 


Dmty 


N-cyclodecylglycine 


Ncdec 


D-a-methyivaline 


Dmval 


N-cylcododecylglycine 


Ncdod 


D-N-methylalanine 


Dnmala 


N-cyclooctylglycine 


Ncoct 


D-N-mettiylarginine 


Dnmarg 


N-cyclopropylglycine 


Ncpro 


25 D-N-metiiylasparagine 


Dnmasn 


N-cycloundecylglycine 


Ncund 


D-N-methylaspartate 


Dnmasp 


N-(2,2-diphenylethyl)glycine 


Nbhm 


D-N-mettiylcysteine 


Dnmcys 


N-(3 ,3-dipheny IpropyOglycine 


Nbhe 


D-N-methylglutamine 


Dnmgln 


N-(3-guanidinopropyl)glycine 


Narg 


D-N-methylglutamate 


Dnmglu 


N-( 1 -hydroxyethy Oglycine 


Nthr 


30 D-N-methylhistidine 


Dnmhis 


N-(hydroxyethyl))glycine 


Nser 
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D-N-methylisoleucine 


Dnmile 


N-(imida2olylethyl))glycine 


Nhis 


D-N-methylleucine 


Dnmleu 


N-(3-indolylyethyl)glycine 


Nhtrp 


D-N-methyllysine 


Dnmlys 


N-methyl-Y-aminobutyrate 


Nmgabu 


N-methylcyclohexylalanine 


Nmchexa 


D-N-methylmethionine 


Dnmmet 


5 D-N-methylomithine 


Dnmom 


N-methylcyclopentylalanine 


Nmcpen 


N-methylglycine 


Nala 


D-N-methylphenylalanine 


Dnmphe 


N-methylaminoisobutyrate 


Nmaib 


D-N-methylproline 


Dmnpro 


N-( 1 -methy Ipropy Oglycine 


Nile 


D-N-methylserine 


Dnmser 


N-(2-methyIpropyl)glycine 


Nleu 


D-N-methylthreonine 


Dmnthr 


10 D-N-methyltryptophan 


Dnmtrp 


N-( 1 -methy lethy l)gly cine 


Nval 


D-N-methyltyrosine 


Dnmtyr 


N-methyla-n^thylalanine 


Nmanap 


D-N-methylvaline 


Diunval 


N-methylpenicillamine 


Nmpen 


Y-aminobutyric acid 


Gabu 


N-(p-hydroxyphenyl)glycine 


Nhtyr 


L-r-butylglycine 


Tbug 


N-(thiomethyl)glycine 


Ncys 


15 L-ethylglycine 


Etg 


penicillamine 


Pen 


L-homophenylalanine 


Hphe 


L-a-methylalanine 


Mala 


L-a-methylarginine 


Marg 


L-a-methylasparagine 


Masn 


L-a-methylaspartate 


Masp 


L-a-methyl-f-butylglycine 


Mtbug 


L-a-methylcysteine 


Mcys 


L-methylediylglycine 


Metg 


20 L-a-methylglutamine 


Mgln 


L-a-methylglutamate 


Mglu 


L-a-methylhistidine 


Mhis 


L-a-methylhomophenylalanine 


Mhphe 


L-a-methylisoleucine 


Mile 


N-(2-metiiylthioethyl)glycine 


Nmet 


L-a-methylleucine 


Mleu 


L-a-methyllysine 


Mlys 


L-a-methylmethionine 


Mmet 


L-a-methylnorleucine 


Mnle 


25 L-a-methylnorvaline 


Mnva 


L-a-methylomithine 


Mom 


L-a-methylphenylalanine 


Mphe 


L-a-methylproline 


Mpro 


L-a-methylserine 


Mser 


L-a-methyltiireonine 


Mthr 


L-a-methyltryptophan 


Mtrp 


L-a-methyltyrosine 


Mtyr 


L-a-methylvaline 


Mval 


L-N-methylhomophenyialanine 


Nmhphe 


30 N-(N-(2,2-diphenylethyl) 


Nnbhm 


N-(N-(3 , 3-dipheny Ipropy 1) 


Nnbhe 
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carbamylmethyOglycine carbamylmethyl)glycine 

1 -carboxy- 1 -(2, 2-dipheny 1- Nmbc 

ethylamino)cyclopropane 



5 

Crosslinkers can be used, for example, to stabilise 3D confonnations, using homo- 
bifunctional crosslinkers such as the bifunctional imido esters having (CH2)n spacer 
groups with n=l to n=6, glutaraldehyde, N-hydroxysuccinimide esters and hetero- 
bifunctional reagents which usually contain an amino-reactive moiety such as N- 

10 hydroxy succinimide and another group specific-reactive moiety such as maleimido or 
dithio moiety (SH) or carbodiimide (COOH). In addition, peptides can be 
conformationally constrained by, for example, incorporation of Q and N^-methylamino 
acids, introduction of double bonds between C„ and Cp atoms of amino acids and the 
formation of cyclic peptides or analogues by introducing covalent bonds such as forming 

15 an amide bond between the N and C termini, between two side chains or between a side 
chain and the N or C terminus. 

The nucleic acid molecule of the present invention is preferably in isolated form or 
ligated to a vector, such as an expression vector. By "isolated" is meant a nucleic acid 

20 molecule having undergone at least one purification step and this is conveniently defmed, 
for example, by a composition comprising at least about 10% subject nucleic acid 
molecule, preferably at least about 20%, more preferably at least about 30%, still more 
preferably at least about 40-50%, even still more preferably at least about 60-70%, yet 
even still more preferably 80-90% or greater of subject nucleic acid molecule relative to 

25 other components as determined by molecular weight, encoding activity, nucleotide 

sequence, base composition or other convenient means. The nucleic acid molecule of the 
present invention may also be considered, in a preferred embodiment, to be biologically 
pure. 



30 
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In a particularly preferred embodiment, the nucleotide sequence corresponding to ELF5 is a 
cDNA sequence comprising a sequence of nucleotides as set forth in one of SEQ ID NO: 1 
or 3 or is a derivative thereof including a nucleotide sequence having similarity to one of 
SEQ ID NO: 1 or 3 and which encodes an amino acid sequence corresponding to an amino 
5 acid sequence set forth in one of SEQ ID NO: 2 or 4 or a sequence having at least about 
45% similarity to one or more of SEQ ED NO:2 or 4. 

In another particularly preferred embodiment, the nucleotide sequence corresponding to 
ELF5 is a cDNA sequence comprising a sequence of nucleotides as set forth in one of SEQ 
10 ID NO: 5 or 6 or is a derivative thereof including a nucleotide sequence having similarity to 
one of SEQ ID NO: 5 or 6 and which encodes an amino acid sequence corresponding to an 
amino acid sequence as set forth in SEQ ID NO: 7 or a sequence having at least about a 45% 
similarity to SEQ ID NO:7. 

15 A derivative of the nucleic acid molecule of the present invention also includes nucleic acid 
molecules capable of hybridising to the nucleotide sequences as set forth in one of SEQ ID 
NO: 1 or 3 or SEQ ID NO: 5 or 6 under low stringency conditions. Preferably said low 
stringency is at 42°C. 

20 The nucleic acid molecule may be ligated to an expression vector capable of expression in a 
prokaryotic cell (eg. E, coli) or a eukaryotic cell (eg. yeast cells, fungal cells, insect cells, 
mammalian cells or plant cells). The nucleic acid molecule may be ligated or fused or 
otherwise associated with a nucleic acid molecule encoding another entity such as, for 
example, a signal peptide, a cytokine or other member of the ETS family. 

25 

The expression product is ELF5 having an amino acid sequence set forth in one of SEQ ID 
NO: 2 or 4 or SEQ ID NO: 7 or is a derivative or homologue as hereinbefore defined or is a 
mammalian homologue having an amino acid sequence of at least about 45% similarity to 
the amino acid sequence set forth in one of SEQ ID NO: 2 or 4 or SEQ ID NO: 7 or 
30 derivative or homologue thereof. 
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The ELF5 of the present invention may be in multimeric form meaning that two or more 
molecules are associated together. Where the same ELF5 molecules are associated 
together, the complex is a homomultimer. An example of a homomultimer is a 
homodimer. Where at least one ELF5 is associated with at least one non-ELF5 
5 molecule, then the complex is a heteromultimer such as a heterodimer. A 

heteromultimer may include a molecule of another member of the ETS family or other 
molecule capable of modulating transcription. 

In accordance with the present invention, it is proposed that ELF5 is a molecule which 
10 regulates cellular functional activity. Reference to cellular "functional activity" should be 
understood as a reference to the functions which a cell is capable of performing such as, but 
in no way limited to, one or more of proliferation, differentiation, cell surface molecule 
expression, antigen presentation, maintenance of viability, apoptosis, metabolism, signal 
transduction and molecular mechanisms such as transcription and translation. Without 
1 5 limiting this invention to any one theory or mode of action, human ELF5 has been mapped 
to human chromosome lp36,31 which is a region that frequently undergoes loss of 
heterozygocity in several types of carcinoma, including breast and colon carcinomas. The 
expression pattern of ELF5 and ELF5 in normal and diseased tissues also supports a role for 
these molecules in the regulation of cellular functional activity and, in particular, in the direct 
20 or indirect regulation of tumorigenesis. Even more particularly, it is proposed that ELF5 
functions as a transcription factor. 

The cloning and sequencing of this gene and its expression product now provides an 
additional gene for use in the prophylactic and therapeutic treatment of diseases such as 

25 those involving aberrant cellular functional activity such as aberrant cellular proliferation. 
Examples of diseases involving aberrant cellular proliferation include diseases caused by 
excessive cellular proliferation, such as in tumorigenesis, or diseases caused by inadequate 
cellular proliferation. Accordingly, the present invention contemplates therapeutic and 
prophylactic uses of ELF5 amino acid and nucleic acid molecules, in addition to ELF5 

30 agonistic and antagonistic agents, for the regulation of cellular functional activity, such as 
for example, regulation of proliferation, differentiation and/or regulation of gene expression 
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by transcriptional regulation. 

The present invention contemplates, therefore, a method for modulating expression of ELF5 
in a subject, said method comprising contacting ELF5 gene with an eflfective amount of an 
5 agent for a time and under conditions sufficient to up-regulate or down-regulate or 

otherwise modulate expression of ELF5, For example, ELF5 antisense sequences such as 
ohgonucleotides may be introduced into a cell to down-regulate one or more specific 
functional activities of that cell. Conversely, a nucleic acid molecule encoding ELF5 or a 
derivative thereof may be introduced to up-regulate one or more specific functional 
10 activities of any cell not expressing the endogenous ELF5 gene. 

Another aspect of the present invention contemplates a method of modulating activity of 
ELF5 in a mammal, said method comprising administering to said manmial a modulating 
eflfective amount of an agent for a time and under conditions suflScient to increase or 
1 5 decrease ELF5 activity: 

Modulation of said activity by the administration of an agent to a mammal can be achieved 
by one of several techniques, including but in no way limited to introducing into said 
mammal a proteinaceous or non-proteinaceous molecule which: 

20 

(i) modulates expression of ELF5\ 

(ii) functions as an antagonist of ELF5; 

25 (iii) functions as an agonist of ELF5. 

Said proteinaceous molecule may be derived fi*om natural or recombinant sources including 
fusion proteins or following, for example, natural product screening. Said non- 
proteinaceous molecule may be, for example, a nucleic acid molecule or may be derived 
30 firom natural sources, such as for example natural product screening or may be chemically 
synthesised. The present invention contemplates chemical analogs of ELF5 capable of 
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acting as agonists or antagonists of ELF5. Chemical agonists may not necessarily be derived 
from ELF5 but may share certain conformational similarities. Alternatively, chemical 
agonists may be specifically designed to mimic certain physiochemical properties of ELF5. 
Antagonists may be any compound capable of blocking, inhibiting or otherwise preventing 
5 ELF5 from carrying out its normal biological functions. Antagonists include monoclonal 
antibodies specific for ELF5, or parts of ELF5, and antisense nucleic acids which prevent 
transcription or translation of ELF5 genes or mRNA in mammalian cells. 

Said proteinaceous or non-proteinaceous molecule may act either directly or indirectly to 
10 modulate the expression of ELF5 or the activity of ELF5. Said molecule acts directly if it 
associates with ELF5 or ELF5 to modulate the expression or activity of ELF5 or ELF5. 
Said molecule acts indirectly if it associates with a molecule other than ELF5 or ELF5 which 
other molecule either directly or indirectly modulates the expression or activity of ELF5 or 
ELF5. Accordingly, the method of the present invention encompasses the regulation of 
15 ELF5 or ELF5 expression or activity via the induction of a cascade of regulatory steps 
which lead to the regulation of ELF5 or ELF5 expression or activity. 

Another aspect of the present invention contemplates a method of modulating cellular 
functional activity in a mammal said method comprising administering to said manunal an 
20 effective amount of an agent for a time and under conditions sufficient to modulate the 

expression of a nucleotide sequence encoding ELF5 or sufficient to modulate the activity of 
ELF5. 

Yet another aspect of the present invention contemplates a method of modulating cellular 
25 functional activity in a mammal said method comprising administering to said mammal an 
effective amount of ELF5 or ELF5. 

The ELF5, ELF5 or agent used may also be linked to a targeting means such as a 
monoclonal antibody, which provides specific delivery of the ELF5, ELF5 or agent to the 
30 target cells. 
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In a preferred embodiment of the present invention, the ELF5, ELF5 or agent used in the 
method is linked to an antibody specific for said target cells to enable specific delivery to 
these cells. 

5 Administration of the ELF5, ELF5 or agent, in the form of a pharmaceutical 

composition, may be performed by any convenient means. ELF5, ELF5 or agent of the 
pharmaceutical composition are contemplated to exhibit therapeutic activity when 
administered in an amount which depends on the particular case. The variation depends, 
for example, on the human or animal and the ELF5, ELF5 or agent chosen. A broad 

10 range of doses may be applicable. Considering a patient, for example, from about 0. 1 
mg to about 1 mg of ELF5 or agent may be administered per kilogram of body weight 
per day. Dosage regimes may be adjusted to provide the optimum therapeutic response. 
For example, several divided doses may be administered daily, weekly, monthly or other 
suitable time intervals or the dose may be proportionally reduced as mdicated by the 

15 exigencies of the situation. The ELF5 or agent may be administered in a convenient 
manner such as by the oral, intravenous (where water soluble), intranasal, 
intraperitoneal, intramuscular, subcutaneous, intradermal or suppository routes or 
implanting (e.g. using slow release molecules). With particular reference to use of ELF5 
or agent, these peptides may be administered in the form of pharmaceutically acceptable 

20 nontoxic salts, such as acid addition salts or metal complexes, e.g. with zinc, iron or the 
like (which are considered as salts for purposes of this application). Illustrative of such 
acid addition salts are hydrochloride, hydrobromide, sulphate, phosphate, maleate, 
acetate, citrate, benzoate, succinate, malate, ascorbate, tartrate and the like. If the active 
ingredient is to be administered in tablet form, the tablet may contain a binder such as 

25 tragacanth, com starch or gelatin; a disintegrating agent, such as alginic acid; and a 
lubricant, such as magnesium stearate. 

A further aspect of the present invention relates to the use of the invention in relation to 
mammalian disease conditions. For example, the present invention is particularly useful, 
30 but in no way limited to, use in cancer therapy. 
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Accordingly, another aspect of the present invention relates to a method of treating a 
mammal said method comprising administering to said mammal an eflfective amount of an 
agent for a time and under conditions suflBcient to modulate the expression of ELF5 or 
sufficient to modulate the activity of ELF5 wherein said modulation results in modulation of 
5 cellular functional activity. 

In another aspect the present invention relates to a method of treating a mammal said 
method comprising administering to said mammal an effective amount of ELF5 or ELF5 for 
a time and under conditions suflBcient to modulate cellular functional activity. 

10 

Yet another aspect of the present invention relates to the use of an agent capable of 
modulating the expression of ELF5 or modulating the activity of ELF5 in the manufacture 
of a medicament for the modulation of cellular functional activity. 

15 A further aspect of the present invention relates to the use of ELF5 or ELF5 in the 
manufacture of a medicament for the modulation of cellular functional activity. 

Still yet another aspect of the present invention relates to agents for use in modulating ELF5 
expression or ELF5 activity wherein said modulation results in modulation of cellular 
20 functional activity. 

Another aspect of the present invention relates to ELF5 or ELF5 for use in modulating 
cellular functional activity. 

25 In a related aspect of the present invention, the mammal undergoing treatment may be 
human or an animal in need of therapeutic or prophylactic treatment. 

In yet another further aspect the present invention contemplates a pharmaceutical 
composition comprising ELF5, ELF5 or an agent capable of modulating ELF5 expression or 
30 ELF5 activity together with one or more pharmaceutically acceptable carriers and/or 
diluents. ELF5^ ELF5 or said agent are referred to as the active ingredients. 
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The pharmaceutical forms suitable for injectable use include sterile aqueous solutions 
(where water soluble) and sterile powders for the extemporaneous preparation of sterile 
injectable solutions or dispersion. In all cases the form must be sterile and must be fluid 
to the extent that easy syringability exists. It must be stable under the conditions of 
5 manufacture and storage and must be preserved against the contaminating action of 
microorganisms such as bacteria and fungi. The carrier can be a solvent or dispersion 
medium containing, for example, water, ethanol, polyol (for example, glycerol, 
propylene glycol and liquid polyethylene glycol, and the like), suitable mixtures thereof, 
and vegetable oils. The proper fluidity can be maintained, for example, by the use of a 

10 coating such as licithin, by the maintenance of the required particle size in the case of 
dispersion and by the use of superfactants. The preventions of the action of 
microorganisms can be brought about by various antibacterial and antifungal agents, for 
example, parabens, chlorobutanol, phenol, sorbic acid, thirmerosal and the like. In 
many cases, it will be preferable to include isotonic agents, for example, sugars or 

15 sodium chloride. Prolonged absorption of the injectable compositions can be brought 
about by the use in the compositions of agents delaying absorption, for example, 
aluminum monostearate and gelatin. 

Sterile injectable solutions are prepared by incorporating the active compounds in the 
20 required amount in the appropriate solvent with various of the other ingredients 

enumerated above, as required, followed by filtered sterilization. Generally, dispersions 
are prepared by incorporating the various sterilized active ingredient into a sterile vehicle 
which contains the basic dispersion medium and the required other ingredients from those 
enumerated above. In the case of sterile powders for the preparation of sterile injectable 
25 solutions, the preferred methods of preparation are vacuum drying and the freeze-drying 
technique which yield a powder of the active ingredient plus any additional desired 
ingredient from previously sterile-filtered solution thereof. 



30 



When ELF5, ELF5 and ELF5 modulators are suitably protected they may be orally 
administered, for example, with an inert diluent or with an assimilable edible carrier, or 
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it may be enclosed in hard or soft shell gelatin capsule, or it may be compressed into 
tablets, or it may be incorporated directly with the food of the diet. For oral therapeutic 
administration, the active compound may be incorporated with excipients and used in the 
form of ingestible tablets, buccal tablets, troches, capsules, elixirs, suspensions, symps, 
5 wafers, and the like. Such compositions and preparations should contain at least 1 % by 
weight of active compound. The percentage of the compositions and preparations may, 
of course, be varied and may conveniently be between about 5 to about 80% of the 
weight of the unit. The amount of active compound in such therapeutically useful 
compositions in such that a suitable dosage will be obtained. Preferred compositions or 
10 preparations according to the present invention are prepared so that an oral dosage unit 
form contains between about 0. 1 fig and 2000 mg of active compound. 

The tablets, troches, pills, capsules and the like may also contain the following: A 
binder such as gum tragacanth, acacia, com starch or gelatin; excipients such as 

15 dicalcium phosphate; a disintegrating agent such as com starch, potato starch, alginic 
acid and the like; a lubricant such as magnesium stearate; and a sweetening agent such a 
sucrose, lactose or saccharin may be added or a flavouring agent such as peppermint, oil 
of wintergreen, or cherry flavouring. When the dosage xmit form is a capsule, it may 
contain, in addition to materials of the above type, a liquid carrier. Various other 

20 materials may be present as coatings or to otherwise modify the physical form of the 
dosage unit. For instance, tablets, pills, or capsules may be coated with shellac, sugar or 
both. A symp or elixir may contain the active compound, sucrose as a sweetening agent, 
methyl and propylparabens as preservatives, a dye and flavouring such as cherry or 
orange flavour. Of course, any material used in preparing any dosage xmit form should 

25 be pharmaceutically pure and substantially non-toxic in the amounts employed. In 
addition, the active compound may be incorporated into sustained-release preparations 
and formulations. 

Pharmaceutically acceptable carriers and/or diluents include any and all solvents, 
30 dispersion media, coatings, antibacterial and antifungal agents, isotonic and absorption 
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delaying agents and the like. The use of such media and agents for pharmaceutical active 
substances is well known in the art. Except insofar as any conventional media or agent is 
incompatible with the active ingredient, use thereof in the therapeutic compositions is 
contemplated. Supplementary active ingredients can also be incoiporated into the 
5 compositions. 

It is especially advantageous to formulate parenteral compositions in dosage unit form for 
ease of administration and uniformity of dosage. Dosage vmit form as used herein refers 
to physically discrete units suited as unitary dosages for the mammalian subjects to be 

10 treated; each unit containing a predetermined quantity of active material calculated to 
produce the desired therapeutic effect in association with the required pharmaceutical 
carrier. The specification for the novel dosage unit forms of the invention are dictated by 
and directly dependent on (a) the unique characteristics of the active material and the 
particular therapeutic effect to be achieved, and (b) the limitations inherent in the art of 

1 5 compounding such an active material for the treatment of disease in living subjects 
having a diseased condition in which bodily health is impaired as herein disclosed in 
detail. 

The principal active ingredient is compounded for convenient and effective administration 
20 in effective amounts with a suitable pharmaceutically acceptable carrier in dosage unit 
form as hereinbefore disclosed. A unit dosage form can, for example, contain the 
principal active compound in amounts ranging from 0.5 /xg to about 2000 mg. Expressed 
in proportions, the active compound is generally present in from about 0.5 /zg to about 
2000 mg/ml of carrier. In the case of compositions containing supplementary active 
25 ingredients, the dosages are determined by reference to the usual dose and manner of 
adniinistration of the said ingredients. 

The pharmaceutical composition may also comprise genetic molecules such as a vector 
capable of transfecting target cells where the vector carries a nucleic acid molecule 
30 capable of modulating ELF5 e?q)ression or ELF5 activity. The vector may, for example, 
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be a viral vector. 

Still another aspect of the present invention is directed to antibodies to ELF5 including 
catalytic antibodies. Such antibodies may be monoclonal or polyclonal and may be 
5 selected from naturally occurring antibodies to ELF5 or may be specifically raised to 
ELF5. In the case of the latter, ELF5 may first need to be associated with a carrier 
molecule. The antibodies and/or recombinant ELF5 of the present invention are 
particularly useful as therapeutic or diagnostic agents. Alternatively, fragments of 
antibodies may be used such as Fab fragments. Furthermore, the present invention 
10 extends to recombinant and synthetic antibodies and to antibody hybrids. A "synthetic 
antibody" is considered herein to include fragments and hybrids of antibodies. The 
antibodies of this aspect of the present invention are particularly useful for 
immunother^y and may also be used as a diagnostic tool for assessing apoptosis or 
monitoring the program of a therspeutic regime. 

15 

For example, ELF5 can be used to screen for naturally occurring antibodies to ELF5. These 
may occur, for example in some degenerative disorders. 

For example, specific antibodies can be used to screen for ELF5 proteins. The latter would 
20 be important, for example, as a means for screening for levels of ELF5 in a cell extract or 
other biological fluid or purifying ELF5 made by recombinant means fi-om culture 
supernatant fluid. Techniques for the assays contemplated herein are known in the art and 
include, for example, sandwich assays, ELIS A and flow cytometry. 

25 It is within the scope of this invention to include any second antibodies (monoclonal, 

polyclonal or fi-agments of antibodies) directed to the first mentioned antibodies discussed 
above. Both the first and second antibodies may be used in detection assays or a first 
antibody may be used with a commercially available anti-immunoglobulin antibody. An 
antibody as contemplated herein includes any antibody specific to any region of ELF5. 

30 

Both polyclonal and monoclonal antibodies are obtainable by immunization with the protein 
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or peptide derivatives and either type is utilizable for immunoassays. The methods of 
obtaining both types of sera are well known in the art. Polyclonal sera are less preferred but 
are relatively easily prepared by injection of a suitable laboratory animal with an effective 
amount of ELF5, or antigenic parts thereof, collecting serum from the animal, and isolating 
5 specific sera by any of the known immunoadsorbent techniques. Although antibodies 
produced by this method are utilizable in virtually any type of immunoassay, they are 
generally less favoured because of the potential heterogeneity of the product: 

The use of monoclonal antibodies in an immunoassay is particularly preferred because of the 
10 ability to produce them in large quantities and the homogeneity of the product. The 

preparation of hybridoma cell lines for monoclonal antibody production derived by fusing an 
immortal cell line and lymphocytes sensitized against the immunogenic preparation can be 
done by'techniques which are well known to those who are skilled in the art. (See, for 
example Douillard and Hoffman, Basic Facts about Hybridomas, in Compendium of 
15 Immunology Vol 11, ed. by Schwartz, 1981; Kohler and Milstein, Nature 256: 495-499, 
191 5\ European Journal of Immunology 6: 511-519, 1976). 

In another aspect of the present invention, the molecules of the present invention are also 
useful as screening targets for use in applications such as the diagnosis of disorders which 
20 are regulated by ELF5. For example, screening for the levels of ELF5 protein or ELF5 
mRNA transcripts in breast or prostate tissue as an indicator of a predisposition to, or the 
development of, breast or prostate cancer. 

Yet another aspect of the present invention contemplates a method for detecting ELF5 or 
25 ELF5 mRNA in a biological sample from a subject said method comprising contacting said 
biological sample with an antibody specific for ELF5 or ELF5 mRNA or its derivatives or 
homologs for a time and under conditions sufficient for an antibody-ELF5 or antibody-ELF5 
mRNA complex to form, and then detecting said complex. 

30 The presence of ELF5 may be determined in a number of ways such as by Western blotting, 
ELISA or flow cytometry procedures. ELF5 mRNA may be detected, for example, by in 
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situ hybridization or Northern blotting. These, of course, include both single-site and two- 
site or "sandwich" assays of the non-competitive types, as well as in the traditional 
competitive binding assays. These assays also include direct binding of a labelled antibody 
to a target. 

5 

Sandwich assays are among the most useful and commonly used assays and are favoured for 
use in the present invention. A number of variations of the sandv^ch assay technique exist, 
and all are intended to be encompassed by the present invention. Briefly, in a typical 
forward assay, an unlabelled antibody is immobilized on a solid substrate and the sample to 

10 be tested brought into contact with the bound molecule. After a suitable period of 
incubation, for a period of time sufficient to allow formation of an antibody-antigen 
complex, a second antibody specific to the antigen, labelled with a reporter molecule capable 
of producing a detectable signal is then added and incubated, allowing time sufficient for the 
formation of another complex of antibody-antigen-labelled antibody. Any unreacted 

15 material is washed away, and the presence of the antigen is determined by observation of a 
signal produced by the reporter molecule. The results may either be qualitative, by simple 
observation of the visible signal, or may be quantitated by comparing with a control sample 
containing knovsm amounts of hapten. Variations on the forward assay include a 
simultaneous assay, in which both sample and labelled antibody are added simultaneously to 

20 the bound antibody. These techniques are well known to those skilled in the art, including 
any minor variations as will be readily apparent. In accordance with the present invention 
the sample is one which might contain Bim mcluding cell extract, tissue biopsy or possibly 
serum, saliva, mucosal secretions, lymph, tissue fluid and respiratory fluid. The sample is, 
therefore, generally a biological sample comprising biological fluid but also extends to 

25 fermentation fluid and supernatant fluid such as fi"om a cell culture. 

In the typical forward sandvwch assay, a first antibody havmg specificity for the ELF5 or 
antigenic parts thereof, is either covalently or passively bound to a solid surface. The solid 
surface is typically glass or a polymer, the most commonly used polymers being cellulose, 
30 polyacrylamide, nylon, polystyrene, polyvinyl chloride or polypropylene. The solid supports 
may be in the form of tubes, beads, discs of microplates, or any other surface suitable for 
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conducting an immunoassay. The binding processes are well-known in the art and generally 
consist of cross-linking covalently binding or physically adsorbing, the polymer-antibody 
complex is washed in preparation for the test sample. An aliquot of the sample to be tested 
is then added to the solid phase complex and incubated for a period of time suflBcient (e.g. 
5 2-40 minutes) and under suitable conditions (e.g. 25 °C) to allow binding of any subunit 
present in the antibody. Following the incubation period, the antibody subunit solid phase is 
washed and dried and incubated with a second antibody specific for a portion of the hapten. 
The second antibody is linked to a reporter molecule which is used to indicate the binding of 
the second antibody to the hapten. 

10 

An alternative method involves immobilizing the target molecules in the biological sample 
and then exposing the immobilized target to specific antibody which may or may not be 
labelled with a reporter molecule. Depending on the amount of target and the strength of 
the reporter molecule signal, a bound target may be detectable by direct labelling v^th the 
15 antibody. Alternatively, a second labelled antibody, specific to the first antibody is exposed 
to the target-first antibody complex to form a target-first antibody-second antibody tertiary 
complex. The complex is detected by the signal emitted by the reporter molecule. 

By "reporter molecule" as used in the present specification, is meant a molecule which, by 
20 its chemical nature, provides an analytically identifiable signal which allows the detection of 
antigen-bound antibody. Detection may be either qualitative or quantitative. The most 
commonly used reporter molecules in this t5^e of assay are either enzymes, fluorophores or 
radionuclide containing molecules (i.e. radioisotopes) and chemiluminescent molecules. 

25 In the case of an enzyme immunoassay, an enzyme is conjugated to the second antibody, 
generally by means of glutaraldehyde or periodate. As will be readily recognized, however, 
a wide variety of diflferent conjugation techniques exist, which are readily available to the 
skilled artisan. Commonly used enzymes include horseradish peroxidase, glucose oxidase, 
beta-galactosidase and alkaline phosphatase, amongst others. The substrates to be used 

30 with the specific enzymes are generally chosen for the production, upon hydrolysis by the 
corresponding enzyme, of a detectable color change. Examples of suitable enzymes include 
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alkaline phosphatase and peroxidase. It is also possible to employ fluorogenic substrates, 
which yield a fluorescent product rather than the chromogenic substrates noted above. In all 
cases, the enzyme-labelled antibody is added to the first antibody hapten complex, allowed 
to bind, and then the excess reagent is washed away. A solution containing the appropriate 
5 substrate is then added to the complex of antibody-antigen-antibody. The substrate will 
react with the enzyme linked to the second antibody, giving a qualitative visual signal, which 
may be fiirther quantitated, usually spectrophotometrically, to give an indication of the 
amount of hapten wliich was present in the sample. "Reporter molecule" also extends to use 
of cell agglutination or inhibition of agglutination such as red blood cells on latex beads, and 
10 the like. 

Alternately, fluorescent compounds, such as fluorecein and rhodamine, may be chemically 
coupled"to antibodies without altering their binding capacity. When activated by 
illumination with light of a particular wavelength, the fluorochrome-labelled antibody 

15 adsorbs the Ught energy, inducing a state to excitability in the molecule, followed by 

emission of the light at a characteristic color visually detectable with a light microscope. As 
in the EIA, the fluorescent labelled antibody is allowed to bind to the first antibody-hapten 
complex, Afl;er washing off the unbound reagent, the remaining tertiary complex is then 
exposed to the light of the appropriate wavelength the fluorescence observed indicates the 

20 presence of the hapten of interest, Immunofluorescene and EIA techniques are both very 
well established in the art and are particularly preferred for the present method. However, 
other reporter molecules, such as radioisotope, chemiluminescent or bioluminescent 
molecules, may also be employed. 

25 The present invention also contemplates genetic assays such as involving PCR analysis to 
detect ELF5 or its derivatives. 

Further features of the present invention are more fully described in the following 
examples. It is to be understood, however, that this detailed description is included 
30 solely for the purposes of exemplifying the present invention. It should not be 

imderstood in any way as a restriction on the broad description of the invention as set out 
above. 



P:\OPER\TDO\ELF5.PRV - 27/8/98 



-37- 

SUMMARY OF SEQ ID Nos. 



Sequence SEP ID NO. 

5 





nucleotide sequence of human ELF5 


1 




amino acid sequence of human ELF5 


2 




nucleotide sequence of human ELF5 short transcript 


3 




amino acid sequence of human ELF5 short transcript 


4 


10 


nucleotide sequence of murine ELFSa 


5 




nucleotide sequence of murine ELFSb 


6 




amino acid sequence of murine ELF5 


7 




oligonucleotide primer 


8 




oligonucleotide primer 


9 


15 


oligonucleotide primer 


10 




oligonucleotide primer 


11 




oligonucleotide primer 


12 




oligonucleotide primer 


13 




oligonucleotide primer 


14 


20 


oligonucleotide primer 


15 



P:\OPER\TDO\ELF5.PRV - 27/8/98 



-38- 
EXAMPLEl 

ISOLATION OF MOUSE AND HUMAN ELF5 cDNAs 

The murine ELF5 cDNA was isolated from an adult mouse lung cDNA library. 
5 Amalgamation of sequence data revealed a 1437 bp sequence with a maximum open 
reading frame (ORF) of 759 bp, predicted to encode a 253 amino acid protein of 
^proximately 31 kD (Figure la). An upstream, in-frame ston codon suggests that this 
ORF represents the full-length coding sequence of ELF5. Additional 91 bp of 5 \ and 
696 bp of 3* sequences were obtained by reverse transcriptase polymerase chain reaction 

10 (PGR) and r^id amplification of cDNA ends (RACE), using day 14 mouse placental 
RNA. Sequence analysis revealed two discrete polyadenylation signals present in the 3* 
untranslated region (UTR). The first of diese (at 1391 bp) appears to be an overlapping 
poly(A)"' recognition signal, AATTAA and ATTAAAA. The second is a consensus 
polyadenylation signal, AATAAA, at 2181 bp. These polyadenylation signals are found 

15 close to the 3* termination of die original clone and the 3* RACE product, respectively, 
suggesting that these represent poly A signals for two separate mRNA products. Thus, 
the two predicted ELF5 cDNAs are 2224 bp and 1528 bp long. Northern blot analysis, 
using the ELF5 coding sequence as a probe, confirmed the presence of two predominant 
ELF5 transcripts in placental tissue, ELF5a and ELF5b, of approximately 2.5 kb and 1.5 

20 kb respectively. Only ELF5a was identified using a 3' UTR fragment from between the 
polyadenylation signals as a probe (Figure lb), indicating that the transcripts differ in 3' 
UTR sequences. 

A human ELF5 cDNA fragment was isolated from a human lung cDNA library 
25 following screening with a cDNA probe containing the coding sequence of mouse ELF5. 
The full coding sequence of human ELF5 was then obtained by reverse transcriptase PCR 
and RACE using human placental RNA. Analysis revealed that the ELF5 sequence is 
predicted to encode a 255 residue amino acid protein. 



30 
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EXAMPLE2 

COMPARISON OF HUMAN AND MOUSE ELF5 AMINO ACID SEQUENCES 

The predicted amino acid sequences of human and mouse ELF5 are highly conserved, 
5 widi approximately 95% identity (Figure 2a). Only a single amino acid substitution was 
observed within the putative ETS domain of human and mouse ELF5, and most of the 
other differing amino acid residues in the full-length sequences are conservative 
substitutions (8/13), suggesting that the two proteins are homologs (i.e. having an 
inferred common ancestry). Interestingly, human ELF5 does, however, contain an 

10 additional two amino acid insertion compared to mouse ELF5. In addition to the ETS 
domain, other features appear to be conserved between these two sequences. These 
include a putative 'pointed' domain (Seth et aL, 1992; Lautenberger et aL, 1992) and 
several consensus casein kinase II (CKII) (Pinna, 1990), protein kinase C (PKC) 
(Kishimoto et aL, 1985; Woodget et aL, 1986) and tyrosine kinase (Patschinsky et al,, 

15 1982; Hunter, 1982; Cooper et al, 1984) phosphorylation sites. 

The ETS domain found within all members of the ETS family is responsible for 
sequence-specific DNA binding (Seth et aL, 1992; Lautenberger et aL, 1992; Wasylyk et 
al. , 1993). The putative ETS domain of human/mouse ELF5, situated at the carboxyl 

20 terminal of the protein, is similar to that of human/mouse ELF3, with amino acid identity 
being 67 % . However, this domain is only moderately similar to that of other ETS 
family members, with the highest amino acid identity being 49% to human NERF, 48% 
to Drosophila ETS4 and E74A, and 46% to human ELFl and ELKl (Figure 2b). 
Sequence identity to other family members is in the range of 44-36%. However, amino 

25 acids highly conserved amongst ETS family members (Janknecht and Nordheim, 1993) 
are well conserved in ELF5 (23/38). Some of these highly conserved residues, such as 
the three tryptophan residues in the carboxyl half of the ETS domain, have been 
demonstrated to be structurally critical for DNA bmdmg of other ETS family members 
(Wang et al , 1992; Was>'lyk et al , 1992). 



30 
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Based on ETS domain similarities, a recent phylogenetic analysis (Graves and Petersen, 
1998) has proposed the grouping of ETS factors into subfamilies, one of which is the 
ELF (E-74-like-factor) subfamily. The ELF subfamily includes Drosophila E74A, 
human ELFl and NERF. A phylogenetic tree was generated including ELF5 and 
5 recently isolated ELF3, by maximum likelihood analysis of the ETS domain (Figure 2c). 
It shows that the human and mouse ELF5 sequences group most closely with the human 
and mouse ELF3 sequences, and that both ELF3 and ELF5 are most closely related to 
Drosophila ETS4, E74A and human ELFl and NERF within the ETS family. Thus, 
Drosophila ETS4, and human/mouse ELF3 and ELF5 may also fall into the ELF 
10 subfamily of ETS factors. 

The phylogency in Figure 2c shows the unrooted relationship among 28 ETS domains. 

EXAMPLES 

1 5 HUMAN CHROMOSOMAL MAPPING OF ELF5 

Human chromosomal localization of ELF5 was performed by PGR, using gene specific 
primers and the Genebridge 4 Radiation Hybrid DNA panel (UK HGMP Resource 
Centre). With these primers, a single product of the expected size (725 bp) was 

20 amplified from total human DNA. The PGR reactions were tiien performed separately 
for each of the individual hybrids. The amplification results from the 93 hybrids were 
submitted to the Radiation Hybrid Mapping server at Whitehead Institute/MIT Center for 
Genome Research for analysis. The result demonstrated that ELF5 is localised to 
chromosome 1. The marker most tighUy linked to ELF5 was NIB1364 (D1S3023) at a 

25 distance of OcR (lod score > 3.0), and this marker is located in die region of lp36.31 
(Figure 3). This chromosomal region frequently undergoes loss of heterozygosity (LOH) 
in several types of carcinoma (Genuardi et a/., 1989; Praml et aL, 1995; Takayama et 
al, 1993; Thrash-Bingham et a/., 1996; Wada et a/., 1998). 



30 
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EXAMPLE 4 

EXPRESSION PATTERN OF ELF5 IN MOUSE TISSUES 

Poly (A) mRNA material derived from various mouse tissues were analysed by Northem 
5 blot hybridization using the murine ELF5 cDNA as a probe. A GAPDH probe was then 
used to control for RNA loading. 

Analysis of ELF5 expression in adult mouse tissues revealed that ELF5 has a restricted 
expression pattern. Expression of two ELF5 transcripts, ELF5-a (2.5 kb) and ELF5-b 

10 (1.5 kb), were observed in lung (Lu), kidney (Ki), stomach (St), ovary (Ov), tongue 
(To), bladder (Bl), and day 2 pregnant (2 Ma) and day 10 pregnant (10 Ma) mammary 
glands, but no expression was observed in liver (Li), heart (He), small intestine (Sm), 
spleen (Sp), thymus (Th), pancreas (Pa), skeletal muscle (Sk), colon (Co) or fat (2 Fa 
and 10 Fa) (Figure 4a). Fat from day 2 (2 Fa) and day 10 (10 Fa) pregnant mice was 

15 used as a control for mammary expression, since the mammary gland contains much fat 
tissue. A single transcript was observed in brain (arrow - qjproximately 2. 1 kb), but of 
a different size to either of the two ELF5 transcripts in other organs. 

The expression of ELF5 was examined in the neonatal mouse (Figure 4b) and during 
20 embryogenesis on days 19, 17 and 16 (Figure 4c), and observed a similar expression 
pattern compared to that of the adult. However, at day 16 stage of embryogenesis low 
levels of ELF5 expression were detected in brain (regular sized transcripts) and small 
intestine, in addition to the expression pattern observed in the adult. 

25 Placental expression of ELF5 displayed an interesting pattem during stages of 

embryogenesis (Figure 4d). Both transcripts were increasingly expressed from day 9.5 to 
day 13 before an overall decrease observed from day 14 to day 19, although some 
expression was observed at day 17. 

30 The two predominant ELF5 mRNA transcripts were observed in variable ratios in 
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different tissues, suggesting that poiyadenylation sites may be utilized differentially, or 
the two transcripts are subject to differential degradation. ELF5-a was expressed more 
strongly in neonatal and embryonic lung and kidney (Figures 4b and c), and adult ovary 
(Figure 4a), compared to ELF5-b. Conversely, ELF5-b was stronger in adult tongue 
5 (Figure 4a), and in all developmental stages of stomach (Figures 4a, b and c), compared 
to ELF5-a. In some RNA samples a further large ( > 10 kb) transcript was variably 
observed. 

EXAMPLES 

1 0 EXPRESSION PATTERN OF ELF5 IN HUMAN TISSUES AND CANCER CELL 

LINES 

Expression of ELF5 in adult human organs was also analysed by Northern blot of 
poly(A)'*" mRNA probed with the human ELF5 cDNA (Figure 5a). A single transcript of 
15 approximately 2.5 kb was strongly expressed in kidney (Ki) and prostate (Pr). However, 
much longer e?q)osures of blots demonstrated just detectable expression of ELF5 in 
placenta (PI) and lung (Lu). Further, ELF5 was cloned from human lung and placenta 
cDNA libraries, confirming that it is expressed in these tissues, albeit probably at very 
low levels, 

20 

ELF5 expression in human cancers was examined. A panel of cancer cell lines, 
including carcinomas of the ovary (CaOv-3), breast (BT-549, ZR-75-1, T47D), kidney 
(786-0), liver (SK-HEP-1), lung (A549), amnion (WISH), prostate (DU145, PCS) and 
endometrium (HEC-1), and melanoma (MEL28), T-cell leukemia (Jurkat) and erythroid 
25 leukemia (K562), were analysed for ELF5 expression by RNAse protection assay (Figure 
5b). A primary fibroblast cell line (CCL32SK) was also included as a sample of non- 
transformed cells. Of all these cell lines only T47D, a progesterone sensitive ductal 
breast carcinoma, was observed to express ELF5. 

30 To evaluate the possibility that lack of ELF5 e?q)ression in carcinoma was due to 
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genomic alterations, a panel of breast and lung carcinoma derived cell lines were 
analysed by Southern blot (Figure 5c). ELF5 gene dosage was compared to that present 
in DNA from normal human blood (based on the 6.5 kb Bglil fragment) and controlled 
by hybridization with a ^-actin cDNA probe. These results are summarized in the lower 
5 panel, where '2* represents a normal allele complement. No evidence was found for 
allelic loss or gene rearrangement in the two breast carcinoma cell lines that did not 
express ELF5 (BT-549 - lane 2, ZR-75-1 - lane 3). However, of nine lung carcinoma 
cell lines, evidence for loss of an ELF5 allele was observed in two (NCI-H358 - lane 8, 
NCI-H441 - lane 11). Hybridization with an ELF3 cDNA probe, which is localised to 

10 the long arm of chromosome 1 (Tymms et c/,, 1997), helped to confirm the specific loss 
of ELF5 alleles. Two other lung carcinoma lines (SK-LU-1 - lane 10, NCI-H661 - lane 
13) displayed hybridization with multiple fragments (shaded arrows) in addition to those 
observed in normal DNA (solid arrows), possibly indicating that at least one ELF5 allele 
has been rearranged in these lines. Confirmation of rearrangement, rather than 

15 restriction fragment length polymorphism (RFLP), was made by additional restriction 
digests. Some cell lines appeared to have amplification or additional copies of the ELF5 
gene. One of these, T47D (lane 4), was the only cell line demonstrated to express 
ELF5, and another, SK-LU-1 (lane 10), appeared to have rearranged alleles. 

20 EXAMPLE 6 

SEQUENCE-SPECIFIC BINDING OF ELF5 TO DNA SEQUENCES 
CONTAINING CONSENSUS ETS SITES 

Although ELF5 displays similarity to the consensus ETS domain, characterising it as an 
25 ETS family member, this sequence is still quite divergent from most other ETS family 
members. The hallmark of ETS factors to bind DNA sites containing a GGAA-core in a 
sequence-specific manner is however shared by ELF5, demonstrating an additional 
functional similarity to the ETS family. A recombinant ELF5 HIS-tag protein of 
approximately 29 kD, expressed in E, coli and purified by metal-affinity chromatography 
30 (Figure 6a, lane 4), displayed strong binding to consensus ETS binding sites, as analysed 
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by electrophoretic mobility shift assay (EMS A) (Figure 6b). ELF5 bound the E74 
oligonucleotide (containing a GGAA-core) (lane 1), but not to the E74ml oligonucleotide 
(which had been mutated to an AGAA-core) (lane 2). The first G-residue of the core has 
been demonstrated to be a physical point of DNA contact for ETSl, and consequendy 
5 essential for DNA binding (Fisher et aL, 1991; Nye et al, 1992). Thus, ELF5 displays 
sequence specific binding to a consensus ETS binding site, binding that is disrupted by a 
mutation known to similarly affect other ETS family members. These results were 
confirmed through competition analysis. The ELF5-E74 complex (lane 3) was efficiently 
competed by the addition of a 100-fold excess of unlabeled E74 (lane 4), but not by 
10 E74ml (lane 5). 

ELF5 also displayed sequence specific binding to different consensus ETS binding 
sequences, and did so with differential affinity (Figure 6b). Competition of the ELF5- 
E74 complex (lane 3) was achieved by consensus ETS sites from the GM-CSF promoter 
15 (lane 6), erb-^l promoter (lane 7) and moloney sarcoma vims (MSV) long terminal 
repeat (LTR) (lane 8). The relative ability of ELF5 to bind these sequences occurred in 
the order: E74 > erbB2 > MSV > GM-CSF. ELF5 did not ^pear to be competed at all 
by an oligonucleotide containing a consensus API binding site (lane 9). ETSl binding to 
E74 was used as a positive control (lane 10). 

20 

EXAMPLE? 

MOUSE ELF5 ACTS AS A TRANSCRIPTIONAL ACTIVATOR 

In addition to DNA binding, another characteristic of most ETS factors is their ability to 
25 transactivate from binding sites in promoters and enhancers. 

A reporter construct, containing the chloramphenicol acetyl-transferase (CAT) driven by 
a minimal TK promoter and multiple ETS/APl binding sites (from the polyomavims 
enhancer), was co-transfected into COS cells together with an ELF5 expression constmct 
30 (Figure 7). Analysis of CAT activities revealed that ELF5 expression resulted in an 
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average five-fold transactivation of the reporter. Further, this transactivation was 
inhibited by addition of an anti-sense ELF5 mRNA expression vector, indicating that 
ELF5 transactivation was due specifically to the product translated from the sense 
construct. 

5 

EXAMPLES 

ISOLATION AND CHARACTERIZATION OF FULL-LENGTH MURINE ELF5 

CDNA 

10 The murine Elf 5 cDNA was isolated from an adult lung cDNA library in Lambda ZAPII 
(Stratagene) following screening with a cDNA probe containing the ETS domain region of 
human ELF3. Additional 5' sequence and 3' sequence were obtained by RT-PCR using a 
Marathon cDNA synthesis Kit (Clontech) and RACE (Rapid Amplification using day 14 
murine of cDNA Ends) placental Poly(A)* RNA. The murine £^-specific PCR products 

15 were cloned into pGEM-T vector (Promega Corp., Madison, WI, USA). All cDNA 

sequences were confirmed by sequencing both strands at least once. 5 'RACE gene-specific 
primer 1:5'GCCAGTCTTG-GTCTCTTCAGCATC.3' (SEQ ID NO:8); 5'-RACE nested- 
gene-specific primer 2: 5 '-AGGAGATGCAGTTGGCATCAAGCT-3 ' (SEQ ID NO:9); 3'- 
RACE gene-specific primer 1: 5 '-AGCCAGTGTTATGGGTGCTG-3 ' (SEQ ID NO: 10); 

20 3'-RACE nested-gene-specific primer 2: 5 '-ACAGTCACTTGATCCACGGCCAATCC-3 ' 
(SEQ ID NO: H). 

EXAMPLE 9 

ISOLATION OF HUMAN ELF5 CODING SEQUENCE 

25 

A human ELF5 cDNa fi-agment was isolated fi-om a human lung cDNA library (GIBCO 
BRL) following screening with a cDNA probe containing the coding sequence of mouse 
Elf 5. The coding sequence was then obtained by RT-PCR using a Marathon cDNA 
synthesis Kit (Clontech) and RACE (Rapid Amplification of cDNA Ends) using human 
30 placental Poly(A)* RNA. The human £LF5-specific PCR products were cloned into 
pGEM-T vector (Promega Corp., Madison, WI, USA). All cDNA sequences were 
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confirmed by sequencing both strands at least once. 

EXAMPLE 10 
STS CONTENT MAPPING 

5 

The following sequence specific primers for human ELF5 were used for PGR. Forward 
primer: 5'-GGGTGGCAGGAAGACAAGCTATGA-3 (SEQ ID NO: 12); Reverse primer: 
5'-CCAATTAAGTCCCAGCTTGATGGC-3 (SEQ ID NO: 13). The PGR reactions were 
performed in Opti-Primer™ 10 x buffer #3 (100 mM Tris-HGl pH 8.3, 35 mM MgGl2, 250 

10 mM Kcl) with 1 lA of Master Mix 50 x buffer (20 mM Tris-HGl pH 8.0, 250 nM EDTA) 
(Opti-Primer™ PGR Optimization Kit, Stratagene), 50 ng of template DNA, 0.2 //g of each 
primer, 1 //I of 10 mM dNTPs and 0.25 U of Taq DNA polymerase in a total volume of 50 
fA, PGR parameters were an initial denaturation step at 94°G for 1 min, followed by 30 
cycles of 94*^0 (1 min), 65 °G (1 min), 72°G (1 min). For Genebridge 4 Radiation Hybrid 

15 DNA panel (UK GHMP Resource Gentre), PGR reactions were performed separately for 
each of the individual hybrids. The PGR resuUs fi-om the 93 hybrids were submitted to the 
Radiation Hybrid Mapping server at Whitehead Institute/MIT Genter for Grenome Research 
(http://www.genome.wi.mit.edu/cgibin/contig/rhmapper.pl). The STS content mapping 
experiment was performed in duplicate and included PGR reactions with no DNA, total 

20 human DNA and total hamster DNA as controls. 

EXAMPLE 11 
SOUTHERN AND NORTHERN BLOT ANALYSIS 

25 Northern analysis of ELF5 expression in human adult organs was performed with 
commercially available blots containing 2 /zg of Poly(A)"' RNA (Glontech). For other 
Northern blots POLY(A)'' mRNA was isolated by a modification of Gonda et al (1992). 
Genomic DNA was isolated by standard techniques (Sambrook et al, 1997), Random- 
primed probes usmg a 898 bp human ELF5 cDNA fi-agment and a 940 bp Styl mouse Elf5 

30 cDNA firagment were generated and Southern/Northern hybridizations performed using 
standard procedures. Blots were re-probed with glyceraldehyde-3-phosphate 
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dehydrogenase (GAPDH) or ^-actin cDNAs to verify RNA/DNA loading. 

EXAMPLE 12 
RNASE PROTECTION ANALYSIS 

5 

ELF5 mRNA abundance in total RNA from human cell lines was determined as described 
previously (Tymms, 1995). Anti-sense RNA probes for human ELF5 and GAPDH 
transcribed from linearized plasmid vectors generated full-length probes of 388 bp and 216 
bp, respectively. The protected products generated by hybridization and RNAse digestion 
10 are 298 bp for ELF5 and 150 bp for GAPDH. 

EXAMPLE 13 
CELL LINES AND CULTURE 

15 Monkey COS7 cells were grown in Dulbecco's modified Eagle's medium (DMEM) 

supplemented with 10% fetal calf serum, 100 U/ml penicillin and 100 //g/mi streptomycin, 
and maintained in a humidified incubator at 5% CO2 and 37** C. 

EXAMPLE 14 
20 PLASMIDS 

pHis6-Elf5 expression vector was made as follows: The murine Elf 5 cDNA was amplified 
using PGR oligonucleotide primers (5'-CGGGATCCTTGGACTCCGTAACCCATAGC-3' 
(SEQ ID NO: 14) and 5'-GCAGATCTCAGAGTTTCTCTTCCTGCC-3' (SEQ ID NO: 15)) 

25 containing a BaniHl restriction site followed by 21 nucleotides of the murine Elf 5 coding 
sequence and SiBgin restriction site followed by 19 nucleotides complementary to the last 
20 nucleotides of the Elf5 coding sequence. The PGR fragment was cloned into the pGEM- 
T vector (Promega Corp., Madison, WI, USA), the BamHl-Sacl restriction fragment with 
the Elf 5 coding sequence was then cloned into the BamHl-Sacl sites of the pQE30 (Qiagen, 

30 Inc. Chatsworth, CA, USA) bacterial expression vector resulting in a N-terminal fiision of 
Elf5 protein to six histidine residues (His-Tag). 
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The Elf5 mammalian expression construct (pBOSElfSs) contains the fiill mouse Elf5 cDNA 
blunt cloned into the T4 polymerase blunted Jt^al site of pEFBOS (Mizushima and Natata, 
1990). Expression from pEFBOS is driven by the elongation factor- 1 promoer. The Elf 5 
anti-sense expression construct is similar, but with reverse orientation of the Elf 5 
5 polyomavirus enhancer oligonucleotides into the BarriHl site of pBLCAT2. 

EXAMPLE 15 

ELECTROPHORETIC MOBILITY SHIFT ASSAYS (EMSA) 

10 Purified recombinant Elf 5 and Etsl proteins were produced as 6XHis-tag fusions in E, coli 
using the QIAexpress expression system (Qiagen), Overnight cultures were diluted 1/10 in 
LB broth and grown for 1 h at 37°C. Expression of recombinant proteins were induced by 
addition of 0. 1 mM IPTG and culture of cells for 2 h. Cells were harvested and sonicated in 
lysis buffer (6 M guanidine, 20 mM Tris-HCl, 50 mM NaCl, pH 8.0), and cell debris 

1 5 removed by centrifugation. One ml of metal His-aflfinity resin was incubated with 

supematants for 30 min, collected, washed in wash buflfer (8 M urea, 20 mM Tris-HCl, 50 
mM NaCl, pH 8.0), and resuspended in renaturation buffer (20 mM Tris-HCl, 50 mM NaCl, 
3 mM dithiothreitol (DTT), pH 8.0), Proteins were eluted from the beads in renaturation 
buffer supplemented with 100 mM imidazole. Purification and integrity of recombinant 

20 proteins were confirmed by denaturing SDS-polyacrylamide gel electrophoresis (PAGE), 

DNA binding experiments with recombinant proteins were performed using EMSA, as 
previously described (Thomas et al, 1995, 1997). Briefly, purified double stranded 
oUgonucleotides were labeled with y-^^P dATP and T4 polynucleotide kinase. 

25 Oligonucleotide probe (1 ng) was incubated for 10 min with approximately 20 ng purified 
ElfS/Etsl protein m DNA binding buffer ( 1 mM EDTA, 10 mM Tris-Hcl pH 8.0, 50 mM 
NaCl, 3 mM DTT, 1 mg/ml BSA, 500 ng/ml poly-d(I-C)d(I-C), 500 ng/ml poly dl-dC, 200 
ng/ml sheared sahnon sperm DNA), ± 100 ng unlabelled competitor oligonucleotides, m 10 
lA final volume. Assays were run through non-denaturing, 7% acrylamide (29 acrylamide:! 

30 bis-acrylamide), 0.5xTBEgelsat4°C. 



/ 

/ 
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EXAMPLE16 

LACK OF ELFS EXPRESSION IN HUMAN PRIMARY BREAST CARCINOMAS 

A panel of human primary breast carcinoma samples were analysed for ELF5 expression 
5 by in situ hybridization (Hogan et aL, 1994). Section from parafm-embedded samples 
were hybridized with ELF5 ^^P-labelled antisense RNA, and signals were detected with a 
photosensitive emulsion. Serial sections were also stained with eosin and haematoxylin. 

A preliminary examination of ELFS expression shows that ELFS is not detectable in 20 
10 out of 20 human breast carcinomas studied, whereas it is strongly expressed in adjacent 
normal epithelium and in epithelial cells from normal subjects (Figure 8). 
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(2) INFORMATION FOR SEQ ID N0:1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1676 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(ix) FEATURE: 

(A) NAME/ICEY: CDS 

(B) LOCATION: 110.. 875 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: 

CAAGGCTACA GGTGTCTTTA TTTCCACTGC ACGCTGGTGC TGGGAGCGCC TGCCTTCTCT 

TGCCTTGAAA GCCTCCTCTT TGGACCTAGC CACCGCTGCC CTCACGGTA ATG TTG 

Met Leu 
1 

GAC TCG GTG ACA CAC AGC ACC TTC CTG CCT AAT GCA TCC TTC TGC GAT 
Asp Ser Val Thr His Ser Thr Phe Leu Pro Asn Ala Ser Phe Cys Asp 
5 10 15 

CCC CTG ATG TCG TGG ACT GAT CTG TTC AGC AAT GAA GAG TAC TAC CCT 
Pro Leu Met Ser Trp Thr Asp Leu Phe Ser Asn Glu Glu Tyr Tyr Pro 
20 25 30 

GCC TTT GAG CAT CAG ACA GCC TGT GAC TCA TAC TGG ACA TCA GTC CAC 
Ala Phe Glu His Gin Thr Ala Cys Asp Ser Tyr Trp Thr Ser Val His 
35 40 45 50 

CCT GAA TAC TGG ACT AAG CGC CAT GTG TGG GAG TGG CTC CAG TTC TGC 
Pro Glu Tyr Trp Thr Lys Arg His Val Trp Glu Trp Leu Gin Phe Cys 
55 60 65 

TGC GAC CAG TAC AAG TTG GAC ACC AAT TGC ATC TCC TTC TGC AAC TTC 
Cys Asp Gin Tyr Lys Leu Asp Thr Asn Cys lie Ser Phe Cys Asn Phe 
70 75 80 

AAC ATC AGT GGC CTG CAG CTG TGC AGC ATG ACA CAG GAG GAG TTT GTC 
Asn lie Ser Gly Leu Gin Leu Cys Ser Met Thr Gin Glu Glu Phe Val 
85 90 95 

GAG GCA GCT GGC TTC TGC GGC GAG TAC CTG TAC TTC ATC CTC CAG AAC 
Glu Ala Ala Gly Phe Cys Gly Glu Tyr Leu Tyr Phe lie Leu Gin Asn 
100 105 110 

ATC CGC ACA CAA GGT TAC TCC TTT TTT AAT GAC GCT GAA GAA AGC AAG 
lie Arg Thr Gin Gly Tyr Ser Phe Phe Asn Asp Ala Glu Glu Ser Lys 
115 120 125 130 

GCC ACC ATC AAA GAC TAT GCT GAT TCC AAC TGC TTG AAA ACA AGT GGC 
Ala Thr lie Lys Asp Tyr Ala Asp Ser Asn Cys Leu Lys Thr Ser Gly 
135 140 145 

ATC AAA AGT CAA GAC TGT CAC AGT CAT AGT AGA ACA AGC CTC CAA AGT 
lie Lys Ser Gin Asp Cys His Ser His Ser Arg Thr Ser Leu Gin Ser 
150 155 160 



TCT CAT CTA TGG GAA TTT GTA CGA GAC CTG CTT CTA TCT CCT GAA GAA 
Ser His Leu Trp Glu Phe Val Arg Asp Leu Leu Leu Ser Pro Glu Glu 
165 170 175 
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AAC TGT GGC ATT CTG GAA TGG GAA GAT AGO GAA CAA GGA ATT TTT CGG 691 
Asn Cys Gly lie Leu Glu Trp Glu Asp Arg Glu Gin Gly lie Phe Arg 
180 185 190 

GTG GTT AAA TCG GAA GCC CTG GCA AAG ATG TGG GGA CAA AGG AAG AAA 73 9 

Val Val Lys Ser Glu Ala Leu Ala Lys Met Trp Gly Gin Arg Lys Lys 
195 200 205 210 

AAT GAC AGA ATG ACA TAT GAA AAG TTG AGC AGA GCC CTG AGA TAC TAC 787 
Asn Asp Arg Met Thr Tyr Glu Lys Leu Ser Arg Ala Leu Arg Tyr Tyr 
215 220 225 

TAT AAA ACA GGA ATT TTG GAG CGG GTT GAC CGA AGG TTA GTG TAC AAA 835 
Tyr Lys Thr Gly lie Leu Glu Arg Val Asp Arg Arg Leu Val Tyr Lys 
230 235 240 



TTT GGA AAA AAT GCA CAC GGG TGG CAG GAA GAC AAG CTA T GATCTGCTCC 885 
Phe Gly Lys Asn Ala His Gly Trp Gin Glu Asp Lys Leu 
245 250 255 



AGGCATCAAG 


CTCATTTTAT 


GGATTTCTGT 


CTTTTAAAAC 


AATCAGATTG 


CAATAGACAT 


945 


TCGAAAGGCT 


TCATTTTCTT 




TTAACCTGCA 


AACATGCTGA 


TAAAATTTCT 


1005 


CCACATCTCA 


GCTTACATTT 


GGATTCAGAG 


TTGTTGTCTA 


CGGAGGGTGA 


GAGCAGAAAC 


1065 


TCTTAAGAAA 


TCCTTTCTTC 


TCCCTAAGGG 


GATGAGGGGA 


TGATCTTTTG 


TGGTGTCTTG 


1125 


ATCAAACTTT 


ATTTTCCTAG 


AGTTGTGGAA 


TGACAACAGC 


CCATGCCATT 


GATGCTGATC 


1185 


AGAGAAAAAC 


TATTCAATTC 


TGCCATTAGA 


GACACATCCA 


ATGCTCCCAT 


CCCAAAGGTT 


1245 


CAAAAGTTTT 


CAAATAACTG 


TGGCAGCTCA 


CCAAAGGTGG 


GGGAAAGCAT 


GATTAGTTTG 


1305 


CAGGTTATGG 


TAGGAGAGGG 


TGAGATATAA 


GACATACATA 


CTTTAGATTT 


TAAATTATTA 


1365 


AAGTCAAAAA 


TCCATAGAAA 


AGTATCCCTT 


Trri"ri''i''i''i'T 


TGAGACGGGT 


TCTCACTATG 


1425 


TTGCCCAGGG 


CTGGTCTTGA 


ACTCCTATGC 


TCAAGTGATC 


CTCCCACCTC 


GGCCTCCCAA 


1485 


AGTACTGTGA 


TTACAAGCGT 


GAGCCACGGC 


ACCTGGGCAG 


AAAAGTATCT 


TAATTAATGA 


1545 


AAGAGCTAAG 


CCATCAAGCT 


GGGACTTAAT 


TGGATTTAAC 


ATAGGTTCAC 


AGAAAGTTTC 


1605 


CTAACCAGAG 


CATCTTTTTG 


ACCACTCAGC 


AAAACTTCCA 


CAGACATCCT 


TCTGGACTTA 


1665 


AACCGGAATT 


C 










1676 



(2) INFORMATION FOR SEQ ID NO: 2; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 255 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Leu Asp Ser Val Thr His Ser Thr Phe Leu Pro Asn Ala Ser Phe 
15 10 15 

Cys Asp Pro Leu Met Ser Trp Thr Asp Leu Phe Ser Asn Glu Glu Tyr 
20 25 30 
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Tyr Pro Ala Phe Glu His Gin Thr Ala Cys Asp Ser Tyr Trp Thr Ser 
35 40 45 

Val His Pro Glu Tyr Trp Thr Lys Arg His Val Trp Glu Trp Leu Gin 
50 55 60 

Phe Cys Cys Asp Gin Tyr Lys Leu Asp Thr Asn Cys lie Ser Phe Cys 
65 70 75 80 

Asn Phe Asn lie Ser Gly Leu Gin Leu Cys Ser Met Thr Gin Glu Glu 
85 90 95 

Phe Val Glu Ala Ala Gly Phe Cys Gly Glu Tyr Leu Tyr Phe lie Leu 
100 105 110 

Gin Asn lie Arg Thr Gin Gly Tyr Ser Phe Phe Asn Asp Ala Glu Glu 
115 120 125 

Ser Lys Ala Thr lie Lys Asp Tyr Ala Asp Ser Asn Cys Leu Lys Thr 
130 135 140 

Ser Gly lie Lys Ser Gin Asp Cys His Ser His Ser Arg Thr Ser Leu 
145 150 155 160 

Gin Ser Ser His Leu Trp Glu Phe Val Arg Asp Leu Leu Leu Ser Pro 
165 170 175 

Glu Glu Asn Cys Gly lie Leu Glu Trp Glu Asp Arg Glu Gin Gly He 
180 185 190 

Phe Arg Val Val Lys Ser Glu Ala Leu Ala Lys Met Trp Gly Gin Arg 
195 200 205 

Lys Lys Asn Asp Arg Met Thr Tyr Glu Lys Leu Ser Arg Ala Leu Arg 
210 215 220 

Tyr Tyr Tyr Lys Thr Gly He Leu Glu Arg Val Asp Arg Arg Leu Val 
225 230 235 240 

Tyr Lys Phe Gly Lys Asn Ala His Gly Trp Gin Glu Asp Lys Leu 
245 250 255 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1415 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

( ix) FEATURE : 

(A) NAME/KEY: CDS 

(B) LOCATION: 134.. 614 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
TGTCTGTAGG TGTCACTTAT ATCACAAGGC TACAGGTGTC TTTATTTCCA CTGCACGCTG 
GTGCTGGGAG CGCCTGCCTT CTCTTGCCTT GAAAGCCTCC TCTTTCGACC TAGCCACCGC 
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TGCCCTCACG GTA ATG TTG GAG TCG GTG ACA CAC AGC ACC TTC CTG COT 169 
Met Leu Asp Ser Val Thr His Ser Thr Phe Leu Pro 
15 10 

AAT GCA TCC CTC TGC GAT CCC CTG ATG TCG TGG ACT GAT CTG TTC AGC 217 
Asn Ala Ser Leu Cys Asp Pro Leu Met Ser Trp Thr Asp Leu Phe Ser 
15 20 25 

AAT GAA GAG TAC TAC CCT GCC TTT GAG CAT CAG ACA GAT GCT GAT TCC 265 
Asn Glu Glu Tyr Tyr Pro Ala Phe Glu His Gin Thr Asp Ala Asp Ser 
30 35 40 

AAC TGC TTG AAA ACA AGT GGC ATC AAA AGC CAA GAC TGT CAC AGT CAT 313 
Asn Cys Leu Lys Thr Ser Gly He Lys Ser Gin Asp Cys His Ser His 
45 50 55 60 

AGT AGA ACA AGC CTC CAA AGT TCT CAT CTA TGG GAA TTT GTA CGA GAC 361 
Ser Ara Thr Ser Leu Gin Ser Ser His Leu Trp Glu Phe Val Arg Asp 
65 70 75 

CTG CTT CTA TCT CCT GAA GAA AAC TGT GGC ATT CTG GAA TGG GAA GAT 409 
Leu Leu Leu Ser Pro Glu Glu Asn Cys Gly He Leu Glu Trp Glu Asp 
80 85 90 

AGG GAA CAA GGA ATT TTT CGG GTG GTT AAA TCG GAA GCC CTG GCA AAG 457 
Arg Glu Gin Gly He Phe Arg Val Val Lys Ser Glu Ala Leu Ala Lys 
95 100 105 

ATG TGG GGA CAA AGG AAG AAA AAT GAC AGA ATG ACA TAT GAA AAG TTG 505 
Met Trp Gly Gin Arg Lys Lys Asn Asp Arg Met Thr Tyr Glu Lys Leu 
110 115 120 

AGC AGA GCC CTG AGA TAC TAC TAT AAA ACA GGA ATT TTG GAG CGG GTT 553 
Ser Arg Ala Leu Arg Tyr Tyr Tyr Lys Thr Gly He Leu Glu Arg Val 
125 130 135 140 

GAC CGA AGG TTA GTG TAC AAA TTT GGA AAA AAT GCA CAC GGG TGG CAG 601 
Asp Arg Arg Leu Val Tyr Lys Phe Gly Lys Asn Ala His Gly Trp Gin 
145 150 155 

GAA GAC AAG CTA T GATCTGCTCC AGGCATCAAG CTCATTTTAT GGATTTCTGT 654 
Glu Asp Lys Leu 
160 

CTTTTAAAAC AATCAGATTG CAATAGACAT TCGAAAGGCT TCATTTTCTT CTCTTTTTTT 714 

TTAACCTGCA AACATGCTGA TAAAATTTCT CCACATCTCA GCTTACATTT GGATTCAGAG 774 

TTGTTGTCTA CGGAGGGTGA GAGCAGAAAC TCTTAAGAAA TCCTTTCTTC TCCCTAAGGG 834 

GATGAGGGGA TGATCTTTTG TGGTGTCTTG ATCAAACTTT ATTTTCCTAG AGTTGTGGAA 894 

TGACAACAGC CCATGCCATT GATGCTGATC AGAGAAAAAC TATTCAATTC TGCCATTAGA 954 

GACACATCCA ATGCTCCCAT CCCAAAGGTT CAAAAGTTTT CAAATAACTG TGGCAGCTCA 1014 

CCAAAGGTGG GGGAAAGCAT GATTAGTTTG CAGGTTATGG TAGGAGAGGG TGAGATATAA 1074 

GACATACATA CTTTAGATTT TAAATTATTA AAGTCAAAAA TCCATAGAAA AGTATCCCTT 1134 

' riTrrrrriT tgagacgggt tctcactatg ttgcccaggg ctggtcttga actcctatgc 1194 

TCAAGTGATC CTCCCACCTC GGCCTCCCAA AGTACTGTGA TTACAAGCGT GAGCCACGGC 1254 
ACCTGGGCAG AAAAGTATCT TAATTAATGA AAGAGCTAAG CCATCAAGCT GGGACTTAAT 1314 
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TGGATTTAAC ATAGGTTCAC AGAAAGTTTC CTAACCAGAG CATCTTTTTG ACCACTCAGC 13 74 
AAAACTTCCA CAGACATCCT TCTGGACTTA AACCGGAATT C 1415 



(2) INFORMATION FOR SEQ ID NO:4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 160 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:4: 

Met Leu Asp Ser Val Thr His Ser Thr Phe Leu Pro Asn Ala Ser Leu 
15 10 15 

Cys Asp Pro Leu Met Ser Trp Thr Asp Leu Phe Ser Asn Glu Glu Tyr 
20 25 30 

Tyr Pro Ala Phe Glu His Gin Thr Asp Ala Asp Ser Asn Cys Leu Lys 
35 40 45 

Thr Ser Gly lie Lys Ser Gin Asp Cys His Ser His Ser Arg Thr Ser 
50 55 60 

Leu Gin Ser Ser His Leu Trp Glu Phe Val Arg Asp Leu Leu Leu Ser 
65 70 75 80 

Pro Glu Glu Asn Cys Gly lie Leu Glu Trp Glu Asp Arg Glu Gin Gly 
85 90 95 

lie Phe Arg Val Val Lys Ser Glu Ala Leu Ala Lys Met Trp Gly Gin 
100 105 110 

Arg Lys Lys Asn Asp Arg Met Thr Tyr Glu Lys Leu Ser Arg Ala Leu 
115 120 125 

Arg Tyr Tyr Tyr Lys Thr Gly lie Leu Glu Arg Val Asp Arg Arg Leu 
130 135 140 

Val Tyr Lys Phe Gly Lys Asn Ala His Gly Trp Gin Glu Asp Lys Leu 
145 150 155 160 



(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2224 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 117.. 876 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
ATCACACGGC TACAGGTGCC TTTATTTCTA CAGTCCGCTG GTGCTGGGAG CGCGCTTGCC 60 
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TTCTCTTGCC TTGAAAGCCT TCTGTCTGGA CCTAGCCACC ACTTGTCTTC ACGGTG 116 

ATG TTG GAC TCC GTA ACC CAT AGC ACC TTC CTG CCC AAC GCA TCC TTC 164 
Met Leu Asp Ser Val Thr His Ser Thr Phe Leu Pro Asn Ala Ser Phe 
15 10 15 

TGT GAC CCC CTG ATG CCT TGG ACC GAT CTG TTC AGC AAT GAA GAC TAC 212 
Cys Asp Pro Leu Met Pro Trp Thr Asp Leu Phe Ser Asn Glu Asp Tyr 
20 25 30 

TAC CCT GCC TTT GAG CAT CAG ACA GCC TGT GAT TCC TAC TGG ACA TCA 260 
Tyr Pro Ala Phe Glu His Gin Thr Ala Cys Asp Ser Tyr Trp Thr Ser 
35 40 45 



GTG CAC CCT GAA TAC TGG ACC AAG CGC CAT GTC TGG GAA TGG CTC CAA 
Val His Pro Glu Tyr Trp Thr Lys Arg His Val Trp Glu Trp Leu Gin 
50 55 60 



308 



TTC TGC TGT GAC CAG TAC AAG CTT GAT GCC AAC TGC ATC TCC TTC TGT 3 56 

Phe Cys Cys Asp Gin Tyr Lys Leu Asp Ala Asn Cys lie Ser Phe Cys 
65 70 75 .80 

CAC TTC AAC ATC AGC GGC CTG CAG CTC TGC AGC ATG ACG CAG GAG GAG 404 
His Phe Asn He Ser Gly Leu Gin Leu Cys Ser Met Thr Gin Glu Glu 
85 90 95 

TTC ATT GAG GCA GCC GGC ATC TGT GGG GAG TAC CTG TAC TTC ATT CTC 4 52 

Phe He Glu Ala Ala Gly He Cys Gly Glu Tyr Leu Tyr Phe He Leu 
100 105 110 

CAG AAC ATT CGC TCG CAA GGT TAC TCC TTT TTC AAT GAT GCT GAA GAG 500 
Gin Asn He Arg Ser Gin Gly Tyr Ser Phe Phe Asn Asp Ala Glu Glu 
115 120 125 

ACC AAG ACT GGC ATC AAA GAC TAT GCT GAT TCC AGT TGC TTG AAA ACA 548 
Thr Lys Thr Gly He Lys Asp Tyr Ala Asp Ser Ser Cys Leu Lys Thr 
130 135 140 

AGT GGC ATC AAG AGT CAA GAC TGT CAC AGC CGA ACA AGC CTC CAA AGT 596 
Ser Gly He Lys Ser Gin Asp Cys His Ser Arg Thr Ser Leu Gin Ser 
145 150 155 160 

TCT CAC CTG TGG GAA TTT GTC AGA GAC TTG CTG CTG TCC CCT GAA GAG 644 
Ser His Leu Trp Glu Phe Val Arg Asp Leu Leu Leu Ser Pro Glu Glu 
165 170 175 

AAC TGT GGC ATC CTG GAA TGG GAA GAC AGG GAG CAG GGC ATT TTC CGA 692 
Asn Cys Gly He Leu Glu Trp Glu Asp Arg Glu Gin Gly He Phe Arg 
180 185 190 

GTG GTT AAG TCA GAA GCC CTG GCA AAG ATG TGG GGA CAA AGG AAG AAG 740 
Val Val Lys Ser Glu Ala Leu Ala Lys Met Trp Gly Gin Arg Lys Lys 
195 200 205 

AAT GAC AGG ATG ACG TAC GAG AAG CTG AGC CGA GCC CTG AGA TAC TAC 788 
Asn Asp Arg Met Thr Tyr Glu Lys Leu Ser Arg Ala Leu Arg Tyr Tyr 
210 215 220 

TAT AAA ACG AGA ATT CTG GAG CGG GTT GAC CGG AGG TTA GTG TAC AAA 836 
Tyr Lys Thr Arg He Leu Glu Arg Val Asp Arg Arg Leu Val Tyr Lys 
225 230 235 240 

TTT GGA AAG AAC GCG CAC GGG TGG CAG GAA GAG AAA CTC T GATGGACACC 886 
Phe Gly Lys Asn Ala His Gly Trp Gin Glu Glu Lys Leu 
245 250 
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GGACACCAGG CTCATTTGAT GGATTTCTGT TGTTGGAAAC AATCAGATCA AACTAGACAT 94 6 

TTGAAAGTCT CCCTCCTCCT CCTCCTCCCC CTCCTTCCCC TCCTCTTCTT CCTCCCCCTC 1006 

CTCCTCTTCA AAACCTACAA ACACACTGAT AAAATTTCTG CATGTCTCAG CTTACATTTG 1066 

AATTCAGTTG TTGTCTATTG GGGCGATGCC ATCAGCCCTT AAGCAATCX3T CTTCATCCCA 1126 

AGGGGGAGGA AGGGATGGTC TTGTGGCAAC TTGGTGTGAC ACTGTCTCCT TAATGAAGTG 1186 

TTTGGAGCTA AGGGAGCCAG TGTTATGGGT GCTGTTTCAC AAGAGGACCC GTTGCACCAT 124 6 

TAAGACACAT GATCCTCCCG TTCCAGGGGT TCTGAGCX3GT CGACTGAGGC AGCTTGCCTG 13 06 

TGGTTAGTTT TTAGGAi^AGG GAGATGTAAG ACTTCCTTGC TTTAGATTTG AAATTATCAC 1366 

AGTTATATTC CATAGAAGAA TTTTTAATTA AAAAAATTTT AGTGGCTAAG CCACTAAACT 14 26 

GGGACCTAAT TGGATGTAGC CTAAGTTACT AATAAGTTCT TAACCAGATC ACCATTTCCA 14 8 6 

ACCACTTAGC CACAGTCACT TGATCCACX3G CCAATCCTTC TGAACTTAAC ATCCTTGTAG 154 6 

TTAGTCACCT TGGGAATTGC TACCTAGATT GTTACCCCCT TCACCTCACT GGTGGCTATC 1606 

ATCAGGTCTA CAGTGACCTG ATCAACAGAC ATGTGCATTA ATTTCTAAAT CACTGCTGTG 1666 

CCTATGATTC AAACCGTCAG CGTGTTCAGT TTATTGATTC TCTCTGAGGT CGGAATTTAT 1726 

TGATTCTCTC TGAGGCTAAG ACATTAAACC TTTACCAAGC AGAGAACGTC CTAACAAGCC 1786 

ACGATAGCCG AACACAGCAT CGATCTCTTC TCTTTTCTGA TGAATACTCA AACTTTCCAA 184 6 

CATATTCTCT TCACAAAAGT AAAGACAGTG AATTTACATC AATCAACX3TT CATGGGTTAA 1906 

AGTCTGCACT GACATTTCCT TGTCTGCCGT TGCATGCCGT TGGCATGCAA GGTGTTAATG 1966 

ACCTGCAACA TGGTGGAGTG CCCTGAACCC TAACTTCCCC AGAGTTGGGA CTGTCTAGTG 2026 

ACCGGCACTG AATAGCAATG CAGGCTGAAG ACCTCCAGGT TTAGAATTTA ACCTCAAAAG 2 086 

TAACTTGTTT TTAAAAAGAA ATGTGAATTA CTGTAAAATA ATCTATTTTT GGATTCGTGT 2146 

GTTTTTCAGG TGGATATAGT TTATAAACAA TGTGAATAAA AAATATTTAA CATGTTTAAA 22 06. 

AAAAAAAAAA AAAAAAAA 2224 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1528 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLiOGY: linear 

(ii) MOLECULE TYPE: DNA 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 117 876 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

ATCACACGGC TACAGGTGCC TTTATTTCTA CAGTCCGCTG GTGCTGGGAG CGOGCTTGCC 60 
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TTCTCTTGCC TTGAAAGCCT TCTGTCTGGA CCTAGCCACC ACTTGTCTTC ACGGTG 116 

ATG TTG GAG TCC GTA ACC CAT AGC ACC TTC CTG CCC AAC OCA TCC TTC 164 
Met Leu Asp Ser Val Thr His Ser Thr Phe Leu Pro Asn Ala Ser Phe 
15 10 15 

TGT GAG CCC CTG ATG CCT TGG ACC GAT CTG TTC AGC AAT GAA GAC TAG 212 
Cys Asp Pro Leu Met Pro Trp Thr Asp Leu Phe Ser Asn Glu Asp Tyr 
20 25 30 

TAG CCT GCC TTT GAG CAT CAG ACA GCC TGT GAT TCC TAG TGG ACA TCA 260 
Tyr Pro Ala Phe Glu His Gin Thr Ala Cys Asp Ser Tyr Trp Thr Ser 
35 40 45 

GTG CAC CCT GAA TAG TGG ACC AAG CGC CAT GTC TGG GAA TGG CTC CAA 3 08 

Val His Pro Glu Tyr Trp Thr Lys Arg His Val Trp Glu Trp Leu Gin 
50 55 60 

TTC TGG TGT GAC CAG TAG AAG CTT GAT GCC AAC TGG ATG TCC TTC TGT 3 56 

Phe Cys Cys Asp Gin Tyr Lys Leu Asp Ala Asn Cys lie Ser Phe Cys 
65 70 75 80 

CAC TTC AAC ATG AGC GGC CTG CAG CTC TGG AGC ATG ACG CAG GAG GAG 4 04 

His Phe Asn lie Ser Gly Leu Gin Leu Cys Ser Met Thr Gin Glu Glu 
85 90 95 

TTC ATT GAG GCA GCC GGC ATG TGT GGG GAG TAG CTG TAG TTC ATT CTC 452 
Phe lie Glu Ala Ala Gly lie Cys Gly Glu Tyr Leu Tyr Phe lie Leu 
100 105 110 

CAG AAC ATT CGC TGG CAA GGT TAG TCC TTT TTC AAT GAT GCT GAA GAG 500 
Gin Asn lie Arg Ser Gin Gly Tyr Ser Phe Phe Asn Asp Ala Glu Glu 
115 120 125 

ACC AAG ACT GGC ATG AAA GAC TAT GCT GAT TCC AGT TGC TTG AAA ACA 548 
Thr Lys Thr Gly lie Lys Asp Tyr Ala Asp Ser Ser Cys Leu Lys Thr 
130 135 140 

AGT GGC ATG AAG AGT CAA GAC TGT CAC AGC CGA ACA AGC CTC CAA AGT 596 
Ser Gly lie Lys Ser Gin Asp Cys His Ser Arg Thr Ser Leu Gin Ser 
145 150 155 160 

TCT CAC CTG TGG GAA TTT GTC AGA GAC TTG CTG CTG TCC CCT GAA GAG 644 
Ser His Leu Trp Glu Phe Val Arg Asp Leu Leu Leu Ser Pro Glu Glu 
165 170 175 

AAC TGT GGC ATG CTG GAA TGG GAA GAC AGG GAG CAG GGC ATT TTC CGA 692 
Asn Cys Gly lie Leu Glu Trp Glu Asp Arg Glu Gin Gly lie Phe Arg 
180 185 190 

GTG GTT AAG TCA GAA GCC CTG GCA AAG ATG TGG GGA CAA AGG AAG AAG 740 
Val Val Lys Ser Glu Ala Leu Ala Lys Met Trp Gly Gin Arg Lys Lys 
195 200 205 

AAT GAC AGG ATG ACG TAG GAG AAG CTG AGC CGA GCC CTG AGA TAG TAG 788 
Asn Asp Arg Met Thr Tyr Glu Lys Leu Ser Arg Ala Leu Arg Tyr Tyr 
210 215 220 

TAT AAA ACG AGA ATT CTG GAG CGG GTT GAC GGG AGG TTA GTG TAG AAA 836 
Tyr Lys Thr Arg lie Leu Glu Arg Val Asp Arg Arg Leu Val Tyr Lys 
225 230 235 240 

TTT GGA AAG AAC GCG CAC GGG TGG CAG GAA GAG AAA CTC T GATGGACACC 886 
Phe Gly Lys Asn Ala His Gly Trp Gin Glu Glu Lys Leu 
245 250 
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GGACACCAGG 


CTCATTTGAT 


GGATTTCTGT 


TGTTGGAAAC 


AATCAGATCA 


AACTAGACAT 


946 


TTGAAAGTCT 


CCCTCCTCCT 


CCTCCTCCCC 


CTCCTTCCCC 


TCCTCTTCTT 


CCTCCCCCTC 


1006 


CTCCTCTTCA 


AAACCTACAA 


ACACACTGAT 


AAAATTTCTG 


CATGTCTCAG 


CTTACATTTG 


1066 


AATTCAGTTG 


TTGTCTATTG 


GGGCGATGCC 


ATCAGCCCTT 


AAGCAATCGT 


CTTCATCCCA 


1126 


AGGGGGAGGA 


AGGGATGGTC 


TTGTGGCAAC 


TTGGTGTGAC 


ACTGTCTCCT 


TAATGAAGTG 


1186 


TTTGGAGCTA 


AGGGAGCCAG 


TGTTATGGGT 


GCTGTTTCAC 


AAGAGGACCC 


GTTGCACCAT 


1246 


TAAGACACAT 


GATCCTCCCG 


TTCCAGGGGT 


TCTGAGCGGT 


CGACTGAGGC 


AGCTTGCCTG 


1306 


TGGTTAGTTT 


TTAGGAAAGG 


GAGATGTAAG 


ACTTCCTTGC 


TTTAGATTTG 


AAATTATCAC 


1366 


AGTTATATTC 


CATAGAAGAA 


TTTTTAATTA 


AAAAAATTTT 


AGTGGCTAAG 


CCACTAAACT 


1426 


GGGACCTAAT 


TGGATGTAGC 


CTAAGTTACT 


AATAAGTTCT 


TAACCAGATC 


ACCATTTCCA 


1486 


ACCACTTAGC 


CACAGTCACT 


TGATCCACGG 


CCAATCCTTC 


TG 




1528 



(2) INFORMATION FOR SEQ ID NO: 7: 

"(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 253 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

Met Leu Asp Ser Val Thr His Ser Thr Phe Leu Pro Asn Ala Ser Phe 
15 10 15 

Cys Asp Pro Leu Met Pro Trp Thr Asp Leu Phe Ser Asn Glu Asp Tyr 
20 25 30 

Tyr Pro Ala Phe Glu His Gin Thr Ala Cys Asp Ser Tyr Trp Thr Ser 
35 40 45 

Val His Pro Glu Tyr Trp Thr Lys Arg His Val Trp Glu Trp Leu Gin 
50 55 60 

Phe Cys Cys Asp Gin Tyr Lys Leu Asp Ala Asn Cys lie Ser Phe Cys 
65 70 75 80 

His Phe Asn lie Ser Gly Leu Gin Leu Cys Ser Met Thr Gin Glu Glu 
85 90 95 

Phe lie Glu Ala Ala Gly lie Cys Gly Glu Tyr Leu Tyr Phe lie Leu 
100 105 110 

Gin Asn lie Arg Ser Gin Gly Tyr Ser Phe Phe Asn Asp Ala Glu Glu 
115 120 125 

Thr Lys Thr Gly lie Lys Asp Tyr Ala Asp Ser Ser Cys Leu Lys Thr 
130 135 140 

Ser Gly lie Lys Ser Gin Asp Cys His Ser Arg Thr Ser Leu Gin Ser 
145 150 155 160 
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Ser His Leu Trp Glu Phe Val Arg 
165 

Asn Cys Gly lie Leu Glu Trp Glu 
180 

Val Val Lys Ser Glu Ala Leu Ala 
195 200 

Asn Asp Arg Met Thr Tyr Glu Lys 
210 215 

Tyr Lys Thr Arg lie Leu Glu Arg 
225 230 

Phe Gly Lys Asn Ala His Gly Trp 
245 



Asp Leu Leu Leu Ser Pro Glu Glu 
170 175 

Asp Arg Glu Gin Gly lie Phe Arg 
185 190 

Lys Met Trp Gly Gin Arg Lys Lys 
205 

Leu Ser Arg Ala Leu Arg Tyr Tyr 
220 

Val Asp Arg Arg Leu Val Tyr Lys 
235 240 

Gin Glu Glu Lys Leu 
250 



(2) INFORMATION -FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
GCCAGTCTTG GTCTCTTCAG CATC 24 



(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNES S : s ingl e 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
AGGAGATGCA GTTGGCATCA AGCT 24 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 



AGCCAGTGTT ATGGGTGCTG 



20 
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(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11 
ACAGTCACTT GATCCACGGC CAATCC 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 
GGGTGGCAGG AAGACAAGCT ATGA 



(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 
CCAATTAAGT CCCAGCTTGA TGGC 



(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14 
CGGGATCCTT GGACTCCGTA ACCCATAGC 
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(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
GCAGATCTCA GAGTTTCTCT TCCTGCC 27 



1/14 



atcacacggctacaggtgcctttatttctacagtccgctggtgctgggagcgcgct 56 

tgccttctcttgccttgaaagccttctgtctggac 91 

c tag ccaccacttgtcttcacggtg 116 

ATOTTGGACTCCGTAACCCATAGCACCTTCCTGCCCAACGCATCCTTCTGTGACCCCCTG 17 6 

ArGCCTTGGACCGATCTGTTCAGCAATGAAGACTACTACCCTGCCTTTGAGCATCAGACA 236 

GCCTGTGATTCCTACTGGACATCAGTGCACCCTGAATACTGGACCAAGCGCCATGTCTGG 2 96 

GAATGGCTCCAATTCTGCTGTGACCAGTACAAGCTTGATGCCAACTGCATCTCCTTCTGT 356 

CACTTCAACATCAGCGGCCTGCAGCTCTGCAGCATGACGCAGGAGGAGTTCATTGAGGCA 416 

GCCGGCATCTGTGGGGAGTACCTGTACTTCATTCTCCAGAACATTCGCTCGCAAGGTTAC 4 7 6 
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TGCTTGAAAACAAGTGGCATCAAGAGTCAAGACTGTCACAGCCGAACAAGCCTCCAAAGT 596 



TCTCACt:TGTGGGAATTTGTCAGAGACTTGCTGCTGTCCCCTGAAGAGAACTGTGGCATC 656 



CTGGAATGG_GAAGACAGGGAGCAGGGCATTTTCCGAGTGGTTAAGTCAGAAGCCCTGGCA 716 



AAGATGTGGGGACAAAGGAAGAAGAATGACAGGATGACGTACGAGAAGCTGAGCCGAGCC 77 6 



CTGAGATACTACTATAAAACGAGAATTCTGGAGCGGGTTGACCGGAGGTTAGTGTAC/yVA 8 36 
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